SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Downloaden Sie, um offline zu lesen
Visual Programming
Environments for
Science and Business
MITCH MILLER
SCIENTIFIC THINKING
CODE CAMP 2015
SEPTEMBER 19, 2015
Disclaimer
 This talk represents my opinion and personal experience using 2 fine
software systems developed by third parties
 The software systems shown are very complex and have hundreds
of components. I have only worked with a small number.
 Every task shown today can be accomplished in multiple ways. I’m
only showing of those ways.
Overview
 Introduction: first demo
 What is a ‘visual programming environment’
 The two systems we’ll look at today
 What are these systems capable of?
 Second set demos (in-depth)
Demo 1: set-up
 Task: produce report of all compounds registered during January
Visual Programming: informal
definition
 Drag functional components onto canvas to create program
 Configure most components by setting parameters
 Connect components to route data from one to another
 Run and observe data traveling down the lines
Component types
 File I/O
 Read/write text files
 Read/write MS Office documents
 XML
 JSON
 PDF
 Database access
 Connect
 Query
 Update
Component types (continued)
 Web service consumption
 Domain-specific processing
 Chemical structure I/O
 Chemical structure processing and analysis
 Sequence processing
 Extensibility
 Add your own libraries for more sophisticated processing
Component types (continued)
 Visualization
 Graphing
 Statistical calculations
 Scripting
 Tip: aim for brief scripts
 Data transformation
 If/else processing
 Filtering
 Column selection
 And many more…
KNIME
 Originally a production of the University of Konstanz, Germany 2004
 Currently produced by KNIME.com AG, a company in Zurich,
Switzerland
 KNIME stands for KoNstanz Information MinEr
 Pronounced “Nighm”
 A general purpose data analytics platform
 Free version available for download
 For-sale version available with added extensions
KNIME (continued)
 Java based
 Written in Java
 Scripted, extensible in Java
 URL: https://www.knime.org/
Pipeline Pilot
 Developed and sold by BIOVIA, San Diego, CA
 Originally developed by Scitegic, San Diego in 1999
 Designed for scientists to “rapidly create, test and publish scientific
services that automate the process of accessing, analyzing and
reporting scientific data”
(http://accelrys.com/products/collaborative-science/biovia-
pipeline-pilot/)
 Client-server system
 Commercial product
 Extensible using .NET and Java
 Scripted using an original language, ‘PilotScript’
KNIME Terminology
 Components are called “Nodes”
 Programs are “Workflows”
 Reusable sets of Nodes are “Metanodes”
 Groups of related Nodes are “Extensions”
Pipeline Pilot Terminology
 Components are called “Components”
 Programs are “Protocols”
 Reusable sets of Components are “Subprotocols”
 Groups of related Components are “Packages”
 Different protocols can be combined
 One protocol provides initial UI –including a Web form
 A second protocol handles form data processing (‘work protocol’)
Different systems shown today
serve different populations
 KNIME can be used ad hoc on the desktop of a power user. It is also
used by companies in a variety of industries
 Pipeline Pilot is geared towards scientists and is part of an enterprise
system and requires a server installation
Programs can be deployed outside
the development client
 Give users a URL to access your program
 Users of BIOVIA Electronic Lab Notebook and other software can access
Pipeline Pilot protocols outside the Pipeline Pilot UI
 Users access a Web application that shows them the data they’re
looking for in a purpose-built user interface
 The application does not look like the system with which it was built
 For-sale version of KNIME Server provides similar functionality
Server Features
 User access configuration
 Shared data sources
 Automatic jobs
 Etc.
Second demo
 Exploration of data set using KNIME and Pipeline Pilot
 Data set comes from National Cancer Institute (NCI)’s Developmental
Therapeutics Program (DTP)
 Results of laboratory tests for activity against 60 types of human cancer
cell lines
 Data freely available:
https://dtp.cancer.gov/discovery_development/nci-60/default.htm
Additional demos
 Pipeline Pilot Web Port sample
Suggestions for getting started
 Download the KNIME software(knime.org)
 Install on your computer
 Look at the sample workflows
 Start simple; build up
Types of applications
 Reporting
 Data set comparisons
 ETL
 Data Analysis
References
 Scholarly article on KNIME and Pipeline Pilot
 https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3414708/
 www.knime.org
 https://www.youtube.com/user/KNIMETV
 http://accelrys.com/products/collaborative-science/biovia-
pipeline-pilot/
 https://dtp.cancer.gov/
Who is your speaker?
 Mitch Miller, Ph.D. in Chemistry and 20+ years of IT experience
 Independent consultant: Scientific Thinking, LLC
 mitch.miller@thinkscience.us
 Some recent projects
 Ongoing custodian of one chemical database implementation for
ChemIDplus project within the National Library of Medicine
 Upgraded 10-year-old Java Servlet lab workflow application to latest
version of JDK, Internet Explorer 11 and implemented enhancements
 Windows service to handle communication between 2 legacy
applications
 Import wizard for chemical array designer
 Merged a set of chemical databases and harmonized data

Weitere ähnliche Inhalte

Ähnlich wie Code camp 2015 visual programming mm

Kallio Chipster Bosc2008
Kallio Chipster Bosc2008Kallio Chipster Bosc2008
Kallio Chipster Bosc2008
bosc_2008
 
Running Head WINDOWS AND LINUX 1WINDOWS AND LINUX12.docx
Running Head WINDOWS AND LINUX     1WINDOWS AND LINUX12.docxRunning Head WINDOWS AND LINUX     1WINDOWS AND LINUX12.docx
Running Head WINDOWS AND LINUX 1WINDOWS AND LINUX12.docx
jeffsrosalyn
 
COMPRO- WEB ALBUM & MOTION ANALYZER
COMPRO- WEB ALBUM  & MOTION ANALYZERCOMPRO- WEB ALBUM  & MOTION ANALYZER
COMPRO- WEB ALBUM & MOTION ANALYZER
Ashish Tanwer
 
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docxSimulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
edgar6wallace88877
 
generic-software-process-models.ppt
generic-software-process-models.pptgeneric-software-process-models.ppt
generic-software-process-models.ppt
Aayush847388
 

Ähnlich wie Code camp 2015 visual programming mm (20)

Kallio Chipster Bosc2008
Kallio Chipster Bosc2008Kallio Chipster Bosc2008
Kallio Chipster Bosc2008
 
Internship msc cs
Internship msc csInternship msc cs
Internship msc cs
 
Crime security.
Crime security.Crime security.
Crime security.
 
Part 2 improving your software development v1.0
Part 2   improving your software development v1.0Part 2   improving your software development v1.0
Part 2 improving your software development v1.0
 
Running Head WINDOWS AND LINUX 1WINDOWS AND LINUX12.docx
Running Head WINDOWS AND LINUX     1WINDOWS AND LINUX12.docxRunning Head WINDOWS AND LINUX     1WINDOWS AND LINUX12.docx
Running Head WINDOWS AND LINUX 1WINDOWS AND LINUX12.docx
 
Documentation
DocumentationDocumentation
Documentation
 
OSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications databaseOSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications database
 
Ohio LinuxFest: Crash Course in Open Source Cloud Computing
Ohio LinuxFest:  Crash Course in Open Source Cloud ComputingOhio LinuxFest:  Crash Course in Open Source Cloud Computing
Ohio LinuxFest: Crash Course in Open Source Cloud Computing
 
Fast, Secure Deployments with Docker on AWS
Fast, Secure Deployments with Docker on AWSFast, Secure Deployments with Docker on AWS
Fast, Secure Deployments with Docker on AWS
 
COMPRO- WEB ALBUM & MOTION ANALYZER
COMPRO- WEB ALBUM  & MOTION ANALYZERCOMPRO- WEB ALBUM  & MOTION ANALYZER
COMPRO- WEB ALBUM & MOTION ANALYZER
 
Applying Linux to the Civil Infrastructure
Applying Linux to the Civil InfrastructureApplying Linux to the Civil Infrastructure
Applying Linux to the Civil Infrastructure
 
Frequently Used Off Host Developer Toolsl
Frequently Used Off Host Developer ToolslFrequently Used Off Host Developer Toolsl
Frequently Used Off Host Developer Toolsl
 
Linuxcon 2011 Crash Course in Open Source Cloud Computing
Linuxcon 2011   Crash Course in Open Source Cloud ComputingLinuxcon 2011   Crash Course in Open Source Cloud Computing
Linuxcon 2011 Crash Course in Open Source Cloud Computing
 
Nt1320 Unit 6
Nt1320 Unit 6Nt1320 Unit 6
Nt1320 Unit 6
 
Proposal with sdlc
Proposal with sdlcProposal with sdlc
Proposal with sdlc
 
Decoder Open Research Webinar
Decoder Open Research WebinarDecoder Open Research Webinar
Decoder Open Research Webinar
 
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docxSimulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
 
The path to an hybrid open source paradigm
The path to an hybrid open source paradigmThe path to an hybrid open source paradigm
The path to an hybrid open source paradigm
 
Unit Testing Essay
Unit Testing EssayUnit Testing Essay
Unit Testing Essay
 
generic-software-process-models.ppt
generic-software-process-models.pptgeneric-software-process-models.ppt
generic-software-process-models.ppt
 

Kürzlich hochgeladen

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 

Kürzlich hochgeladen (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

Code camp 2015 visual programming mm

  • 1. Visual Programming Environments for Science and Business MITCH MILLER SCIENTIFIC THINKING CODE CAMP 2015 SEPTEMBER 19, 2015
  • 2. Disclaimer  This talk represents my opinion and personal experience using 2 fine software systems developed by third parties  The software systems shown are very complex and have hundreds of components. I have only worked with a small number.  Every task shown today can be accomplished in multiple ways. I’m only showing of those ways.
  • 3. Overview  Introduction: first demo  What is a ‘visual programming environment’  The two systems we’ll look at today  What are these systems capable of?  Second set demos (in-depth)
  • 4. Demo 1: set-up  Task: produce report of all compounds registered during January
  • 5. Visual Programming: informal definition  Drag functional components onto canvas to create program  Configure most components by setting parameters  Connect components to route data from one to another  Run and observe data traveling down the lines
  • 6. Component types  File I/O  Read/write text files  Read/write MS Office documents  XML  JSON  PDF  Database access  Connect  Query  Update
  • 7. Component types (continued)  Web service consumption  Domain-specific processing  Chemical structure I/O  Chemical structure processing and analysis  Sequence processing  Extensibility  Add your own libraries for more sophisticated processing
  • 8. Component types (continued)  Visualization  Graphing  Statistical calculations  Scripting  Tip: aim for brief scripts  Data transformation  If/else processing  Filtering  Column selection  And many more…
  • 9. KNIME  Originally a production of the University of Konstanz, Germany 2004  Currently produced by KNIME.com AG, a company in Zurich, Switzerland  KNIME stands for KoNstanz Information MinEr  Pronounced “Nighm”  A general purpose data analytics platform  Free version available for download  For-sale version available with added extensions
  • 10. KNIME (continued)  Java based  Written in Java  Scripted, extensible in Java  URL: https://www.knime.org/
  • 11. Pipeline Pilot  Developed and sold by BIOVIA, San Diego, CA  Originally developed by Scitegic, San Diego in 1999  Designed for scientists to “rapidly create, test and publish scientific services that automate the process of accessing, analyzing and reporting scientific data” (http://accelrys.com/products/collaborative-science/biovia- pipeline-pilot/)  Client-server system  Commercial product  Extensible using .NET and Java  Scripted using an original language, ‘PilotScript’
  • 12. KNIME Terminology  Components are called “Nodes”  Programs are “Workflows”  Reusable sets of Nodes are “Metanodes”  Groups of related Nodes are “Extensions”
  • 13. Pipeline Pilot Terminology  Components are called “Components”  Programs are “Protocols”  Reusable sets of Components are “Subprotocols”  Groups of related Components are “Packages”  Different protocols can be combined  One protocol provides initial UI –including a Web form  A second protocol handles form data processing (‘work protocol’)
  • 14. Different systems shown today serve different populations  KNIME can be used ad hoc on the desktop of a power user. It is also used by companies in a variety of industries  Pipeline Pilot is geared towards scientists and is part of an enterprise system and requires a server installation
  • 15. Programs can be deployed outside the development client  Give users a URL to access your program  Users of BIOVIA Electronic Lab Notebook and other software can access Pipeline Pilot protocols outside the Pipeline Pilot UI  Users access a Web application that shows them the data they’re looking for in a purpose-built user interface  The application does not look like the system with which it was built  For-sale version of KNIME Server provides similar functionality
  • 16. Server Features  User access configuration  Shared data sources  Automatic jobs  Etc.
  • 17. Second demo  Exploration of data set using KNIME and Pipeline Pilot  Data set comes from National Cancer Institute (NCI)’s Developmental Therapeutics Program (DTP)  Results of laboratory tests for activity against 60 types of human cancer cell lines  Data freely available: https://dtp.cancer.gov/discovery_development/nci-60/default.htm
  • 18. Additional demos  Pipeline Pilot Web Port sample
  • 19. Suggestions for getting started  Download the KNIME software(knime.org)  Install on your computer  Look at the sample workflows  Start simple; build up
  • 20. Types of applications  Reporting  Data set comparisons  ETL  Data Analysis
  • 21. References  Scholarly article on KNIME and Pipeline Pilot  https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3414708/  www.knime.org  https://www.youtube.com/user/KNIMETV  http://accelrys.com/products/collaborative-science/biovia- pipeline-pilot/  https://dtp.cancer.gov/
  • 22. Who is your speaker?  Mitch Miller, Ph.D. in Chemistry and 20+ years of IT experience  Independent consultant: Scientific Thinking, LLC  mitch.miller@thinkscience.us  Some recent projects  Ongoing custodian of one chemical database implementation for ChemIDplus project within the National Library of Medicine  Upgraded 10-year-old Java Servlet lab workflow application to latest version of JDK, Internet Explorer 11 and implemented enhancements  Windows service to handle communication between 2 legacy applications  Import wizard for chemical array designer  Merged a set of chemical databases and harmonized data