Implementing chemistry platform for OpenPHACTS

Independent Consultant um Science Data Software, LLC
24. Mar 2016
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
1 von 33

Más contenido relacionado

Was ist angesagt?

Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry
Crowdsourcing, Collaborations and Text-Mining in a World of Open Chemistry US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
Small Molecules in Big Data - Analytica MunichSmall Molecules in Big Data - Analytica Munich
Small Molecules in Big Data - Analytica MunichEmma Schymanski
Building a semantic chemistry platform with the royal society of chemistryBuilding a semantic chemistry platform with the royal society of chemistry
Building a semantic chemistry platform with the royal society of chemistryValery Tkachenko
Why Chemistry and the Web Will Benefit from a ChemSpiderWhy Chemistry and the Web Will Benefit from a ChemSpider
Why Chemistry and the Web Will Benefit from a ChemSpiderUS Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
The royal society of chemistry and its adoption of semantic web technologies ...The royal society of chemistry and its adoption of semantic web technologies ...
The royal society of chemistry and its adoption of semantic web technologies ...Valery Tkachenko
SETAC Rome Non-Target Screening For Chemical DiscoverySETAC Rome Non-Target Screening For Chemical Discovery
SETAC Rome Non-Target Screening For Chemical DiscoveryEmma Schymanski

Was ist angesagt?(20)

Destacado

OpenPHACTS - Chemistry Platform Update and LearningsOpenPHACTS - Chemistry Platform Update and Learnings
OpenPHACTS - Chemistry Platform Update and LearningsValery Tkachenko
Opportunities in chemical structure standardizationOpportunities in chemical structure standardization
Opportunities in chemical structure standardizationValery Tkachenko
Experiences and adventures with no sql and its applications to cheminformatic...Experiences and adventures with no sql and its applications to cheminformatic...
Experiences and adventures with no sql and its applications to cheminformatic...Valery Tkachenko
Text mining to produce large chemistry datasets for community accessText mining to produce large chemistry datasets for community access
Text mining to produce large chemistry datasets for community accessValery Tkachenko
Not just another reaction databaseNot just another reaction database
Not just another reaction databaseValery Tkachenko
Marilyn Gardner Milton: Advanced Law School Chat Pt. 3Marilyn Gardner Milton: Advanced Law School Chat Pt. 3
Marilyn Gardner Milton: Advanced Law School Chat Pt. 3Marilyn Gardner Milton MA

Similar a Implementing chemistry platform for OpenPHACTS

Open PHACTS (Sept 2013) EBI Industry ProgrammeOpen PHACTS (Sept 2013) EBI Industry Programme
Open PHACTS (Sept 2013) EBI Industry ProgrammeSciBite Limited
Opening up pharmacological space, the OPEN PHACTs apiOpening up pharmacological space, the OPEN PHACTs api
Opening up pharmacological space, the OPEN PHACTs apiChris Evelo
Open chemistry registry and mapping platform based on open source cheminforma...Open chemistry registry and mapping platform based on open source cheminforma...
Open chemistry registry and mapping platform based on open source cheminforma...Valery Tkachenko
Pistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS FoundationPistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS Foundation
Pistoia Alliance European Conference 2015 - Nick Lynch / Open PHACTS FoundationPistoia Alliance
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)
BDE SC1 Workshop 3 - Open PHACTS Pilot (Kiera McNeice)BigData_Europe
2011-11-28 Open PHACTS at RSC CICAG2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAGopen_phacts

Similar a Implementing chemistry platform for OpenPHACTS(20)

Más de Valery Tkachenko

Evolution of public chemistry databases: past and the futureEvolution of public chemistry databases: past and the future
Evolution of public chemistry databases: past and the futureValery Tkachenko
In silico design of new functional materialsIn silico design of new functional materials
In silico design of new functional materialsValery Tkachenko
Metal-organic frameworks: from database to supramolecular effects in complexa...Metal-organic frameworks: from database to supramolecular effects in complexa...
Metal-organic frameworks: from database to supramolecular effects in complexa...Valery Tkachenko
Abstract recommendation system: beyond word-level representationsAbstract recommendation system: beyond word-level representations
Abstract recommendation system: beyond word-level representationsValery Tkachenko
Machine learning methods for chemical properties and toxicity based endpointsMachine learning methods for chemical properties and toxicity based endpoints
Machine learning methods for chemical properties and toxicity based endpointsValery Tkachenko
Chemical workflows supporting automated research data collectionChemical workflows supporting automated research data collection
Chemical workflows supporting automated research data collectionValery Tkachenko

Más de Valery Tkachenko(20)

Último

Cormas RMoDCormas RMoD
Cormas RMoDOleksandr Zaitsev
Tissue Plasminogen Activator.pptxTissue Plasminogen Activator.pptx
Tissue Plasminogen Activator.pptxAnieshR3
CARBOHYDRATE CLASSIFICATION.pptxCARBOHYDRATE CLASSIFICATION.pptx
CARBOHYDRATE CLASSIFICATION.pptxDrDharmeshTewari
A tilted dark halo origin of the Galactic disk warp and flareA tilted dark halo origin of the Galactic disk warp and flare
A tilted dark halo origin of the Galactic disk warp and flareSérgio Sacani
Self-Organisation Programming: a Functional Reactive Macro Approach (FRASP) [...Self-Organisation Programming: a Functional Reactive Macro Approach (FRASP) [...
Self-Organisation Programming: a Functional Reactive Macro Approach (FRASP) [...Roberto Casadei
Astronomaly at Scale: Searching for Anomalies Amongst 4 Million GalaxiesAstronomaly at Scale: Searching for Anomalies Amongst 4 Million Galaxies
Astronomaly at Scale: Searching for Anomalies Amongst 4 Million GalaxiesSérgio Sacani

Implementing chemistry platform for OpenPHACTS

Hinweis der Redaktion

  1. Remember this, some of these questions are easier to answer than others
  2. Using available public data is critical to drug discovery
  3. 10 Can go get everything Open PHACTS not a repo of the world, specific sources
  4. 8
  5. Open PHACTS was developed to support the key questions of drug discovery Business questions have been at the heart of Open PHACTS and have driven the development of the platform Mx/psa, how calculated who did it? Mash up. With your data too, - top layer join together but need them all commercial Data provided by many publishers Originally in many formats: relational, SD files and RDF Worked closely with publishers Data licensing was a major issue Over 5 billion triples – 14 datasets & growing Hosted on beefy hardware; data in memory (aim) Extensive memcaching Pose complex queries to extract data
  6. 4 million full-text patent documents annotated • USPTO, WIPO, EPO – English language • Life-sciences relevant • Patents mapped to SureChEMBL IDs (e.g. EP-1339685-A2) • Title, publication date, classification codes • Compounds mapped to SCHEMBL IDs (e.g. SCHEMBL15064) • Genes mapped to HGNC symbols (e.g. FDFT1) • Diseases mapped to MeSH terms (e.g. D009765)
  7. Db Stds :which ones (later) Access (API). Driven by the API. Acelerate bulding if apps
  8. Open PHACTS discover platform is now supported by a Foundation
  9. Seen a growing usage of the platform both in volume and in registered applications API remains the cornerstone of the delivery
  10. Connected to different consumer groups
  11. Once users get connected to an API, they tend to stick with it. We are listening to this and will have a version independent URL
  12. Here we see the RSC dataset in Open PHACT’s data repository, which is running the open source Artifactory. The repository understands Maven metadata, and also maintain and verifies checksums of data artifacts. We see here it includes the suggested <dependency> setting for using the dataset from a different Maven project – while this would be a bit exotic perhaps, doing so would put the dataset directly on the classloader without any worrying about downloads or file paths. We can see the hierarchy of the dataset on the left – the repository has expanded the archive for us. The .ro folder contains the Research Object manifest, the void file is the Dataset description – the rest is the “actual data”. One power of Maven is the ease of setting up mirroring – the dataset above is actually from a mirror of the Maven repository of the build server in Manchester.
  13. All have probably seen this slide. Want to pick out some of the key changes and tomorrow will here more