SlideShare ist ein Scribd-Unternehmen logo
1 von 4
Downloaden Sie, um offline zu lesen
IOSR Journal of Computer Engineering (IOSR-JCE)
e-ISSN: 2278-0661, p- ISSN: 2278-8727Volume 14, Issue 2 (Sep. - Oct. 2013), PP 53-56
www.iosrjournals.org
www.iosrjournals.org 53 | Page
Big Data in Bioinformatics & the Era of Cloud Computing
Prakash Nemade1
, Heena Kharche2
1
(Department of Bioinformatics, Maulana Azad National Institute of Technology, Bhopal MP, India)
2
(Department of Computer Science & Engineering, IES-IPS Academy, Indore MP, India)
Abstract: With the recent breakthrough in bioinformatics, demand of more storage space is increasing day by
day. With this exponentially increasing data i.e. the big data of bioinformatics sector, the data needs to be
handled in more flexible and cost effective manner. With this growth in the volume, variety and velocity of data,
cloud computing promises to address big data issues and analysis of challenges of big data in bioinformatics.
Big Data of bioinformatics consisting of various sequences (nucleotide or amino acid) which are now showing
exponential growth because of high throughput experimental technologies. By the use of various bioinformatics
tools demands for large and fast data storage technology is increasing. In this paper, security model is re-
drafted for flexible use of model by consumers and various cloud based services and techniques are coined up to
make an easy approach for implementing big data of bioinformatics using cloud.
Keywords: Big Data, Bio Cloud, Bioinformatics, Cloud, Cloud Computing, Secure Cloud
I. INTRODUCTION
In general bioinformatics can be defined as ―An interdisciplinary field that develops and improves
upon method for storing, retrieving, organizing and analyzing biological data‖ [1].With the emergence of highly
efficient bioinformatics technologies, the volume, variety and velocity of data is increasing with a great pace. As
reported on February 2012, two nanopore sequencing platforms (GridION and MinION) are capable of
delivering ultra-long sequencing reads (~100kb) with additionally higher throughput and much lower cost [2].
With this amount of data, it would be increasingly daunting for small scale industries to invest in a large data
server that will be kept in back room, to store your data. Moreover big servers demands for highly skilled people
to maintain the data which increases their operating costs. At present, a promising solution to address this
challenge is cloud computing, which exploits the full potential of multiple computers and delivers computation
and storage as dynamically allocated virtual resources via the Internet [3].
Bigdata in bioinformatics face various implementation challenges which hinders customer to shift their
data onto the cloud despite of several advantages. Cloud Computing is here to stay, as it is proposed to
transform the way IT is deployed and managed, promising reduced implementation, maintenance costs and
complexity, while accelerating innovation, providing faster time to market, and providing the ability to scale
high-performance applications and infrastructures on demand [4]. Features like Broad network access, resource
pooling, pay as you go, rapid elasticity and on demand access has made cloud computing king of computing.
With the use of various services and concepts of cloud computing one can easily handle the problems of big data
in bioinformatics. With the better use of the services i.e Software as a service (SaaS), Platform as a service
(PaaS) and Infrastructure as a service (Iaas) [4] we can easily access the data at affordable costs. Cloud
computing basically uses three Deployment models namely Public cloud: Cloud sharing data of various
enterprises on single storage server; Private Cloud: A dedicated cloud storage server of a particular enterprise
and Hybrid Cloud: Combination of above two deployment models for easy and flexible use of information.
The bioinformatics big data opens up enormous possibilities for discovery, solutions, analysis and even
cures. Biological big data are generally challenging for their variety, value, and critical need for veracity. The
modern biological data covers diverse collection of omics field like genomics, proteomics, metabolomics,
metagenomics, lipidomics and glycomics. The omic data is generated from high throughput experimental
technologies. To store and analyze these data requires massive amounts of computational power of databases,
data formats, software packages and pipelines [5]. Today, the competition is on for the solutions to analyze and
store the data faster and easier and to bring cost down to the point where sequencing becomes easily available
for a much wider market. Penta bytes of raw information can be used to provide hints for everything from
preventing terabytes to shrinking healthcare costs – if we can figure out how to use the space efficiently.
This paper will be heading towards the constructive approach for saving bioinformatics big data using
cloud computing and also the challenges that would hinder bioinformatics bigdata to take a step towards cloud.
This paper is organized in the following sections:
In Section II, we give a background of big data in bioinformatics. We explore in more detail with the example of
bigdata in bioinformatics.
In Section III, we describe issues of bioinformatics big data.
In Section IV, we describe the ways in which cloud will give the solutions to challenges states in Section III.
Big Data In Bioinformatics & Era Of Cloud Computing
www.iosrjournals.org 54 | Page
Finally in section V, we will give the conclusion showing importance of the suggested solutions and their
extensions.
II. BIOINFORMATICS BIG DATA
As we move into the century, we stand at an inflection point in biology how we view and practice
biology has forever changed. Biology is becoming data intensive as high throughput experiments are generating
data to much greater pace as high throughput experiment technologies are becoming accessible to more number
of scientists. One of the inflection points is Human Genome Project (HGP) which is an international scientific
research project with a primary goal of determining the sequence of chemical base pairs which make up human
DNA, and of identifying and mapping the total genes of the human genome from both a physical and functional
standpoint. [6]
Human Genome Project provided a genetics parts list and catalyzed the development of high
throughput measurement tools (e.g. the high speed DNA sequences, DNA arrays, high throughput mass
spectrometry, etc) and high throughput measurement strategies (e.g. yeast two hybrid technique for measuring
protein protein interactions) as well as stimulating the development of powerful new computational tools for
acquiring, storing and analyzing biological information. The HGP has catalyzed the view that biology is an
informational science. Therefore, the huge amount of data generated by these high throughput experiments in
the genomic and proteomic arenas requires that databases are built in a way that facilitates not just the storage of
these data, but the efficient handling and retrieval of information from these huge databases. [7]
The use of DNA sequencing machines, which are smaller in size but capable of generating piles of data
faster and at a lower cost, have changed science and medicine in ways never seen before [8]. The current era is
beginning to look like the era of ―big data‖; a term refers to explosion of available information, which is
byproduct of the digital revolution [9]. However, with biomedical data accumulating in computers and servers
around the world [8], concerns over privacy and security of patient data are emerging.
Big data has affected several unrelated sectors in society including communications, media, medicine,
scientific research among others [9]. However, even though both the computers and the internet have become
faster, we have lack of computational infrastructure that is needed to securely generate, maintain, transfer and
analyze large scale information in life sciences to integrate omics data with other data sets, such as clinical data
from patients. Next Generation Sequencing (NGS) platforms that use semiconductors [10] or nanotechnology
[11] have exponentially increased the rate of biological data generation in the last three years.
III. ISSUES IN BIOINFORMATICS BIG DATA
Big Data generation and acquisition gives birth to profound challenges for storage, transfer and security
of the information. Even if companies were forced to limit their data collection and the storage space, still the
big data analytics would be needed. Big data storage space would be needed by companies to store their data
without any limits. Also, the computational time is needed to be decreased with the increase in the data for faster
processing and efficient results. Having large storage and computational infrastructure can be difficult to
maintain by the company moreover, implementing cost of this infrastructure may touch the sky.
Another challenge is to transfer the data from one location to another; it is mainly done either by the
use of external hard disks or by mail. Transfer and accessing of this data becomes time consuming leading to
decrease in processing time. Moreover, transfer of data may reduce work efficiency. Big data has to be
processed and computed simultaneously so that we can get faster outputs which can be shared and used from
any location the user wants to access. Transfer of big data may require large storage space transfer devices
which can be time consuming and may sometime lead to inefficient results due to unavailability of updated
results in the system.
Security and the privacy of data from individuals is also a concern. In every case the most important
issue of handling bioinformatics big data is security of the data whether it is in the storage database or while
transferring of data via external hard disks or email, security issue is the worry. Data has to be free from any
threats as well as data integrity and security has to be maintained. In this growing era of technologies,
authenticity and confidentiality of the data is also necessary for the safety of data.
IV. CLOUD AS SOLUTION TO BIOINFORMATICS BIG DATA
With the increased need to store data and information generated by big projects, an easy and flexible
computational solution such as cloud based computing has emerged. Cloud computing is the only storage model
that can provide the elastic scale needed for big data in the field of bioinformatics whose volume of output
generation is increasing day by day. With the promising technology of cloud nothing is impossible. The issues
that are stated above can be solved in one or the other ways described below.
1. Big Data Storage Space: With cloud computing one can use infrastructure that is data is now resided on the
outsourced servers. Use of cloud infrastructure which falls under Infrastructure as a Service (one of the
Big Data In Bioinformatics & Era Of Cloud Computing
www.iosrjournals.org 55 | Page
cloud services) [4]. As infrastructure as a service, service of cloud provides you all the necessary resources
like server, networking, storage space etc. to be used easily and efficiently, it has become very easy to store
the big data. The cost to keep your data on cloud infrastructure is very affordable as cloud uses ―pay as you
go‖ feature. It also provides rapid capacity elasticity and easily deliverable whenever and wherever you
want. Use of cloud infrastructure allows multi tenancy allowing you to share the data by multiple
companies in smart and secure manner. You can automate regular practices—such as provisioning, backup,
and replication—and integrate those practices as part of your cloud services, the better your environment
will scale. Using this multi tenancy and automation feature a group of people working on the same project
may simultaneously get the updated results provided by each other.
2. Data Transfer: Say no to all external hard drives to carry your data, as all your data is on cloud and you can
access your data from anywhere anytime or you can share your data by using multi tenant nature of the
cloud. Anyone who is authenticated and authorized to access that data will automatically get the updates
data which you wanted to transfer. Cloud has reduced the fear of unavailability of data due to hardware
failure during transfer to a great extent. Moreover, if you want to switch the data from one cloud service
provider to another it can be easily done without any difficulty. Proper service level agreements are signed
in order to ensure you the data security at cloud providers’ storage space.
3. Security and Privacy: The major issue to move or store your big data in cloud is its security. Security
solutions includes the use of better security systems with advanced encryption algorithms and proper
signing of Service level agreements may secure your data to a higher level.
As the data is residing on third party storage server, one is very much concerned about his data. In
order to trust that the cloud provider has kept your data safe on his premises one should use proper security
measures from his side also. Trust can be easily implemented by using the digital certificates [12] with proper
transport layer security, to completely rely on the cloud provider, as customers’ data is now within their
enterprise boundary as shown by Kharche H. et al. Security Model, Public Root CA and Enterprise CA can be
implemented to secure the big data efficiently and effectively. Making the model little more flexible and easy to
use by each and every individual according to their need the implementation flow has been re-drafted. The
flowchart given in Figure 1consists of three phases:
Phase 1: In phase one, according to the needs of the user one can use any encryption algorithm with
any key size fulfilling his requirement. Using any encryption algorithm as per user need makes the flow chart
more flexible as now the expertise of the encryption algorithm used is in the enterprise boundary.
Phase 2: In phase two, the company should create and use his own digital certificates in order to sign
the encrypted data resulted in phase 1. The private key generated during your own digital certificate generation
should be kept confidential as no one else will be able to open your data now.
Secure Your Data With Any Encryption
Algorithm with Any Key Size
Create Your Own Digital certificates &
Sign the Encrypted data
Use Transport Layer Security or Secure
Socket Layer Security
Figure 1: Flowchart to Implement Security and Trust
Big Data In Bioinformatics & Era Of Cloud Computing
www.iosrjournals.org 56 | Page
Phase 3: In phase three, the data is sent over the network in secured form. That is, one can use either of Secure
Socket Layer (SSL) or Transport Layer Security (TLS) to send the whole data over the network in the secured
format which makes the data triple secure.
Above suggested model is very easy and flexible to be used by any company with great ease with all the secrets
of the data encryption and the private key in the enterprise boundary. Even though the data is resided at the
cloud providers’ storage space, he doesn’t know about how data is secured at the enterprise level. This makes
the above model more secure and easy to use.
With cloud computing, one can easily overcome all the challenges which were hindering the focus of customers
to shift to cloud. All the challenges of storage space, transfer of data and security over cloud makes it very
affordable for an individual to move towards cloud. Infrastructure maintenance is done at cloud providers’ end,
which reduces the cost of hiring technical expertise and the maintenance of the high data storage servers.
V. CONCLUSION
Two clear advantages can be gained by using cloud to store bioinformatics bigdata. One, It let
companies to analyze massive datasets without capital investment in hardware and host the data internally on
the server. Two, hosted model tends to abstract the complexity, enabling more immediate deployment of big
data technology. One can easily use the cloud for the bioinformatics big data to large genomic sequences
efficiently with proper security, processing speed and affordable costs. Using the re-drafted flow of the model
individual can easily gain the trust on cloud. In this era of cloud computing one can easily store enormous
amount of data with the easily scalability of resources in cloud Big Data leads to big challenges leading to
proper arrangements to data storage in cloud effectively. Bioinformatics big data will need to implement
scalable structure to deal with volumes of data generated by omics technologies. Advances in informatics to
successfully address the big data challenges that to be faced in next decade should be adapted soon.
REFERENCES
[1] Wikipedia found at http://en.wikipedia.org/wiki/Bioinformatics
[2] Schatz MC, Langmead B, Salzberg SL. Cloud computing and the DNA data race. Nat Biotechnol.2010;28(7):691–693. doi:
10.1038/nbt0710-691.
[3] Armbrust M, Fox A, Griffith R, Joseph AD, Katz RH, Konwinski A, Lee G, Patterson DA, Rabkin A,Stoica I, Above the Clouds: A
Berkeley View of Cloud Computing. Berkeley: EECS Department, University of California; 2009.
[4] Kharche, Ms Heena, and Mr Deepak Singh Chouhan. "Building Trust In Cloud Using Public Key Infrastructure." International
Journal of Advanced Computer Science and Applications 3.3 (2012).
[5] Higdon et al ―Unraveling the complexities of life sciences data‖. Mary Ann Liebert Inc. Vol. 1, No. 1, Big Data.
[6] Wikipedia found at http://en.wikipedia.org/wiki/Human_Genome_Project
[7] Andreas D. Baxevanis, B.F. Francis Quellette Bioinformatics: A practical guide to the analysis of genes and proteins. (NJ: Wiley-
Interscience, 2005).
[8] Costa FF ,Big Data In Biomedicine. Drug Discovery Today. In press (2013)
[9] www.nytimes.com/2012/08/12/business/how-bigdat a- became-so-big-unboxed.html?r=0
[10] Rothberg J. M. et al. ―An integrated semiconductor device enabling non-optical genome sequencing‖: Nature 475(7356): 348–
352(2011)
[11] Clarke J. et al. ―Continuous base identification for single-molecule nanopore DNA sequencing‖: Nat Nanotechnol. (4): 265–70
(2009)
[12] Kharche, Ms Heena, and Mr Deepak Singh Chouhan. "Implementing Trust In Cloud Using Public Key Infrastructure." International
Journal of Engineering Inventions 1.5 (2012).

Weitere ähnliche Inhalte

Was ist angesagt?

Big data security and privacy issues in the
Big data security and privacy issues in theBig data security and privacy issues in the
Big data security and privacy issues in theIJNSA Journal
 
Big data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing PlatformsBig data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing PlatformsIJERA Editor
 
Mining Big Data using Genetic Algorithm
Mining Big Data using Genetic AlgorithmMining Big Data using Genetic Algorithm
Mining Big Data using Genetic AlgorithmIRJET Journal
 
Paper id 252014139
Paper id 252014139Paper id 252014139
Paper id 252014139IJRAT
 
Toward a real time framework in cloudlet-based architecture
Toward a real time framework in cloudlet-based architectureToward a real time framework in cloudlet-based architecture
Toward a real time framework in cloudlet-based architectureredpel dot com
 
Information Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis ApproachInformation Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis ApproachAIRCC Publishing Corporation
 
BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE
BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE
BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE csandit
 
Trust threads : Active Curation and Publishing in SEAD
Trust threads : Active Curation and Publishing in SEADTrust threads : Active Curation and Publishing in SEAD
Trust threads : Active Curation and Publishing in SEADBeth Plale
 
IRJET- A Novel Framework for Three Level Isolation in Cloud System based ...
IRJET-  	  A Novel Framework for Three Level Isolation in Cloud System based ...IRJET-  	  A Novel Framework for Three Level Isolation in Cloud System based ...
IRJET- A Novel Framework for Three Level Isolation in Cloud System based ...IRJET Journal
 
data mining with big data
data mining with big datadata mining with big data
data mining with big dataswathi78
 
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian network
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian networkImpact of big data congestion in IT: An adaptive knowledgebased Bayesian network
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian networkIJECEIAES
 
Trust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail ScienceTrust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail ScienceBeth Plale
 
Frequency and similarity aware partitioning for cloud storage based on space ...
Frequency and similarity aware partitioning for cloud storage based on space ...Frequency and similarity aware partitioning for cloud storage based on space ...
Frequency and similarity aware partitioning for cloud storage based on space ...redpel dot com
 
A scalabl e and cost effective framework for privacy preservation over big d...
A  scalabl e and cost effective framework for privacy preservation over big d...A  scalabl e and cost effective framework for privacy preservation over big d...
A scalabl e and cost effective framework for privacy preservation over big d...amna alhabib
 
Big data: Challenges, Practices and Technologies
Big data: Challenges, Practices and TechnologiesBig data: Challenges, Practices and Technologies
Big data: Challenges, Practices and TechnologiesNavneet Randhawa
 
COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...
COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...
COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...ijcsit
 
Aaas Data Intensive Science And Grid
Aaas Data Intensive Science And GridAaas Data Intensive Science And Grid
Aaas Data Intensive Science And GridIan Foster
 
Improving availability and reducing redundancy using deduplication of cloud s...
Improving availability and reducing redundancy using deduplication of cloud s...Improving availability and reducing redundancy using deduplication of cloud s...
Improving availability and reducing redundancy using deduplication of cloud s...dhanarajp
 

Was ist angesagt? (20)

Big data security and privacy issues in the
Big data security and privacy issues in theBig data security and privacy issues in the
Big data security and privacy issues in the
 
Big data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing PlatformsBig data Mining Using Very-Large-Scale Data Processing Platforms
Big data Mining Using Very-Large-Scale Data Processing Platforms
 
Dq36708711
Dq36708711Dq36708711
Dq36708711
 
Mining Big Data using Genetic Algorithm
Mining Big Data using Genetic AlgorithmMining Big Data using Genetic Algorithm
Mining Big Data using Genetic Algorithm
 
Paper id 252014139
Paper id 252014139Paper id 252014139
Paper id 252014139
 
Toward a real time framework in cloudlet-based architecture
Toward a real time framework in cloudlet-based architectureToward a real time framework in cloudlet-based architecture
Toward a real time framework in cloudlet-based architecture
 
Information Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis ApproachInformation Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis Approach
 
LITERATURE SURVEY ON BIG DATA AND PRESERVING PRIVACY FOR THE BIG DATA IN CLOUD
LITERATURE SURVEY ON BIG DATA AND PRESERVING PRIVACY FOR THE BIG DATA IN CLOUDLITERATURE SURVEY ON BIG DATA AND PRESERVING PRIVACY FOR THE BIG DATA IN CLOUD
LITERATURE SURVEY ON BIG DATA AND PRESERVING PRIVACY FOR THE BIG DATA IN CLOUD
 
BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE
BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE
BIG DATA TECHNOLOGY ACCELERATE GENOMICS PRECISION MEDICINE
 
Trust threads : Active Curation and Publishing in SEAD
Trust threads : Active Curation and Publishing in SEADTrust threads : Active Curation and Publishing in SEAD
Trust threads : Active Curation and Publishing in SEAD
 
IRJET- A Novel Framework for Three Level Isolation in Cloud System based ...
IRJET-  	  A Novel Framework for Three Level Isolation in Cloud System based ...IRJET-  	  A Novel Framework for Three Level Isolation in Cloud System based ...
IRJET- A Novel Framework for Three Level Isolation in Cloud System based ...
 
data mining with big data
data mining with big datadata mining with big data
data mining with big data
 
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian network
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian networkImpact of big data congestion in IT: An adaptive knowledgebased Bayesian network
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian network
 
Trust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail ScienceTrust threads: Provenance for Data Reuse in Long Tail Science
Trust threads: Provenance for Data Reuse in Long Tail Science
 
Frequency and similarity aware partitioning for cloud storage based on space ...
Frequency and similarity aware partitioning for cloud storage based on space ...Frequency and similarity aware partitioning for cloud storage based on space ...
Frequency and similarity aware partitioning for cloud storage based on space ...
 
A scalabl e and cost effective framework for privacy preservation over big d...
A  scalabl e and cost effective framework for privacy preservation over big d...A  scalabl e and cost effective framework for privacy preservation over big d...
A scalabl e and cost effective framework for privacy preservation over big d...
 
Big data: Challenges, Practices and Technologies
Big data: Challenges, Practices and TechnologiesBig data: Challenges, Practices and Technologies
Big data: Challenges, Practices and Technologies
 
COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...
COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...
COMPLETE END-TO-END LOW COST SOLUTION TO A 3D SCANNING SYSTEM WITH INTEGRATED...
 
Aaas Data Intensive Science And Grid
Aaas Data Intensive Science And GridAaas Data Intensive Science And Grid
Aaas Data Intensive Science And Grid
 
Improving availability and reducing redundancy using deduplication of cloud s...
Improving availability and reducing redundancy using deduplication of cloud s...Improving availability and reducing redundancy using deduplication of cloud s...
Improving availability and reducing redundancy using deduplication of cloud s...
 

Andere mochten auch

Area Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips ProcessorArea Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips ProcessorIOSR Journals
 
To Reduce The Handover Delay In Wimax When The Mobile Station Moves At Higher...
To Reduce The Handover Delay In Wimax When The Mobile Station Moves At Higher...To Reduce The Handover Delay In Wimax When The Mobile Station Moves At Higher...
To Reduce The Handover Delay In Wimax When The Mobile Station Moves At Higher...IOSR Journals
 
Segmentation of Overlapped and Touching Human Chromosome images
Segmentation of Overlapped and Touching Human Chromosome imagesSegmentation of Overlapped and Touching Human Chromosome images
Segmentation of Overlapped and Touching Human Chromosome imagesIOSR Journals
 
Chemical Composition, In Vitro Digestibility And Gas Production Characteristi...
Chemical Composition, In Vitro Digestibility And Gas Production Characteristi...Chemical Composition, In Vitro Digestibility And Gas Production Characteristi...
Chemical Composition, In Vitro Digestibility And Gas Production Characteristi...IOSR Journals
 
Lexical and Parser tool for CBOOP program
Lexical and Parser tool for CBOOP programLexical and Parser tool for CBOOP program
Lexical and Parser tool for CBOOP programIOSR Journals
 
Design, Modelling& Simulation of Double Sided Linear Segmented Switched Reluc...
Design, Modelling& Simulation of Double Sided Linear Segmented Switched Reluc...Design, Modelling& Simulation of Double Sided Linear Segmented Switched Reluc...
Design, Modelling& Simulation of Double Sided Linear Segmented Switched Reluc...IOSR Journals
 
Matlab Based Analysis of 3-Ø Self-Excited Induction Generator with Nonlinear ...
Matlab Based Analysis of 3-Ø Self-Excited Induction Generator with Nonlinear ...Matlab Based Analysis of 3-Ø Self-Excited Induction Generator with Nonlinear ...
Matlab Based Analysis of 3-Ø Self-Excited Induction Generator with Nonlinear ...IOSR Journals
 
A Robust Approach for Detecting Data Leakage and Data Leaker in Organizations
A Robust Approach for Detecting Data Leakage and Data Leaker in OrganizationsA Robust Approach for Detecting Data Leakage and Data Leaker in Organizations
A Robust Approach for Detecting Data Leakage and Data Leaker in OrganizationsIOSR Journals
 
Distributed Intrusion Detection System for Wireless Sensor Networks
Distributed Intrusion Detection System for Wireless Sensor NetworksDistributed Intrusion Detection System for Wireless Sensor Networks
Distributed Intrusion Detection System for Wireless Sensor NetworksIOSR Journals
 
Approximating Source Accuracy Using Dublicate Records in Da-ta Integration
Approximating Source Accuracy Using Dublicate Records in Da-ta IntegrationApproximating Source Accuracy Using Dublicate Records in Da-ta Integration
Approximating Source Accuracy Using Dublicate Records in Da-ta IntegrationIOSR Journals
 
Concepts and Derivatives of Web Services
Concepts and Derivatives of Web ServicesConcepts and Derivatives of Web Services
Concepts and Derivatives of Web ServicesIOSR Journals
 
Structural, compositional and electrochemical properties of Aluminium-Silicon...
Structural, compositional and electrochemical properties of Aluminium-Silicon...Structural, compositional and electrochemical properties of Aluminium-Silicon...
Structural, compositional and electrochemical properties of Aluminium-Silicon...IOSR Journals
 
A new approach for user identification in web usage mining preprocessing
A new approach for user identification in web usage mining preprocessingA new approach for user identification in web usage mining preprocessing
A new approach for user identification in web usage mining preprocessingIOSR Journals
 
Discovering Frequent Patterns with New Mining Procedure
Discovering Frequent Patterns with New Mining ProcedureDiscovering Frequent Patterns with New Mining Procedure
Discovering Frequent Patterns with New Mining ProcedureIOSR Journals
 
Designing the Workflow of a Language Interpretation Device Using Artificial I...
Designing the Workflow of a Language Interpretation Device Using Artificial I...Designing the Workflow of a Language Interpretation Device Using Artificial I...
Designing the Workflow of a Language Interpretation Device Using Artificial I...IOSR Journals
 
Performance Analysis of Single Quantum Dots and Couple Quantum Dots at Interm...
Performance Analysis of Single Quantum Dots and Couple Quantum Dots at Interm...Performance Analysis of Single Quantum Dots and Couple Quantum Dots at Interm...
Performance Analysis of Single Quantum Dots and Couple Quantum Dots at Interm...IOSR Journals
 

Andere mochten auch (20)

Area Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips ProcessorArea Optimized Implementation For Mips Processor
Area Optimized Implementation For Mips Processor
 
To Reduce The Handover Delay In Wimax When The Mobile Station Moves At Higher...
To Reduce The Handover Delay In Wimax When The Mobile Station Moves At Higher...To Reduce The Handover Delay In Wimax When The Mobile Station Moves At Higher...
To Reduce The Handover Delay In Wimax When The Mobile Station Moves At Higher...
 
Segmentation of Overlapped and Touching Human Chromosome images
Segmentation of Overlapped and Touching Human Chromosome imagesSegmentation of Overlapped and Touching Human Chromosome images
Segmentation of Overlapped and Touching Human Chromosome images
 
Chemical Composition, In Vitro Digestibility And Gas Production Characteristi...
Chemical Composition, In Vitro Digestibility And Gas Production Characteristi...Chemical Composition, In Vitro Digestibility And Gas Production Characteristi...
Chemical Composition, In Vitro Digestibility And Gas Production Characteristi...
 
E0212730
E0212730E0212730
E0212730
 
B01110814
B01110814B01110814
B01110814
 
Lexical and Parser tool for CBOOP program
Lexical and Parser tool for CBOOP programLexical and Parser tool for CBOOP program
Lexical and Parser tool for CBOOP program
 
A0420104
A0420104A0420104
A0420104
 
Design, Modelling& Simulation of Double Sided Linear Segmented Switched Reluc...
Design, Modelling& Simulation of Double Sided Linear Segmented Switched Reluc...Design, Modelling& Simulation of Double Sided Linear Segmented Switched Reluc...
Design, Modelling& Simulation of Double Sided Linear Segmented Switched Reluc...
 
Matlab Based Analysis of 3-Ø Self-Excited Induction Generator with Nonlinear ...
Matlab Based Analysis of 3-Ø Self-Excited Induction Generator with Nonlinear ...Matlab Based Analysis of 3-Ø Self-Excited Induction Generator with Nonlinear ...
Matlab Based Analysis of 3-Ø Self-Excited Induction Generator with Nonlinear ...
 
A Robust Approach for Detecting Data Leakage and Data Leaker in Organizations
A Robust Approach for Detecting Data Leakage and Data Leaker in OrganizationsA Robust Approach for Detecting Data Leakage and Data Leaker in Organizations
A Robust Approach for Detecting Data Leakage and Data Leaker in Organizations
 
Distributed Intrusion Detection System for Wireless Sensor Networks
Distributed Intrusion Detection System for Wireless Sensor NetworksDistributed Intrusion Detection System for Wireless Sensor Networks
Distributed Intrusion Detection System for Wireless Sensor Networks
 
Approximating Source Accuracy Using Dublicate Records in Da-ta Integration
Approximating Source Accuracy Using Dublicate Records in Da-ta IntegrationApproximating Source Accuracy Using Dublicate Records in Da-ta Integration
Approximating Source Accuracy Using Dublicate Records in Da-ta Integration
 
Concepts and Derivatives of Web Services
Concepts and Derivatives of Web ServicesConcepts and Derivatives of Web Services
Concepts and Derivatives of Web Services
 
L01117074
L01117074L01117074
L01117074
 
Structural, compositional and electrochemical properties of Aluminium-Silicon...
Structural, compositional and electrochemical properties of Aluminium-Silicon...Structural, compositional and electrochemical properties of Aluminium-Silicon...
Structural, compositional and electrochemical properties of Aluminium-Silicon...
 
A new approach for user identification in web usage mining preprocessing
A new approach for user identification in web usage mining preprocessingA new approach for user identification in web usage mining preprocessing
A new approach for user identification in web usage mining preprocessing
 
Discovering Frequent Patterns with New Mining Procedure
Discovering Frequent Patterns with New Mining ProcedureDiscovering Frequent Patterns with New Mining Procedure
Discovering Frequent Patterns with New Mining Procedure
 
Designing the Workflow of a Language Interpretation Device Using Artificial I...
Designing the Workflow of a Language Interpretation Device Using Artificial I...Designing the Workflow of a Language Interpretation Device Using Artificial I...
Designing the Workflow of a Language Interpretation Device Using Artificial I...
 
Performance Analysis of Single Quantum Dots and Couple Quantum Dots at Interm...
Performance Analysis of Single Quantum Dots and Couple Quantum Dots at Interm...Performance Analysis of Single Quantum Dots and Couple Quantum Dots at Interm...
Performance Analysis of Single Quantum Dots and Couple Quantum Dots at Interm...
 

Ähnlich wie Big Data in Bioinformatics & the Era of Cloud Computing

wireless sensor network
wireless sensor networkwireless sensor network
wireless sensor networkparry prabhu
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Scienceijtsrd
 
Understand the Idea of Big Data and in Present Scenario
Understand the Idea of Big Data and in Present ScenarioUnderstand the Idea of Big Data and in Present Scenario
Understand the Idea of Big Data and in Present ScenarioAI Publications
 
06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyanIAESIJEECS
 
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Onyebuchi nosiri
 
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Onyebuchi nosiri
 
Identifying and analyzing the transient and permanent barriers for big data
Identifying and analyzing the transient and permanent barriers for big dataIdentifying and analyzing the transient and permanent barriers for big data
Identifying and analyzing the transient and permanent barriers for big datasarfraznawaz
 
A Deep Dissertion Of Data Science Related Issues And Its Applications
A Deep Dissertion Of Data Science  Related Issues And Its ApplicationsA Deep Dissertion Of Data Science  Related Issues And Its Applications
A Deep Dissertion Of Data Science Related Issues And Its ApplicationsTracy Hill
 
A Comprehensive Study on Big Data Applications and Challenges
A Comprehensive Study on Big Data Applications and ChallengesA Comprehensive Study on Big Data Applications and Challenges
A Comprehensive Study on Big Data Applications and Challengesijcisjournal
 
SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16,
SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16, SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16,
SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16, ChereCheek752
 
A Survey on Big Data Mining Challenges
A Survey on Big Data Mining ChallengesA Survey on Big Data Mining Challenges
A Survey on Big Data Mining ChallengesEditor IJMTER
 
SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16, .docx
SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16, .docxSPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16, .docx
SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16, .docxrosiecabaniss
 
Next generation big data analytics state of the art
Next generation big data analytics state of the artNext generation big data analytics state of the art
Next generation big data analytics state of the artNazrul Islam
 
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTINGSECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTINGIJNSA Journal
 
Security issues associated with big data in cloud computing
Security issues associated with big data in cloud computingSecurity issues associated with big data in cloud computing
Security issues associated with big data in cloud computingIJNSA Journal
 
big data Big Things
big data Big Thingsbig data Big Things
big data Big Thingspateelhs
 
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...IJERDJOURNAL
 
Challenges and outlook with Big Data
Challenges and outlook with Big Data Challenges and outlook with Big Data
Challenges and outlook with Big Data IJCERT JOURNAL
 
A Model Design of Big Data Processing using HACE Theorem
A Model Design of Big Data Processing using HACE TheoremA Model Design of Big Data Processing using HACE Theorem
A Model Design of Big Data Processing using HACE TheoremAnthonyOtuonye
 

Ähnlich wie Big Data in Bioinformatics & the Era of Cloud Computing (20)

wireless sensor network
wireless sensor networkwireless sensor network
wireless sensor network
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Science
 
Understand the Idea of Big Data and in Present Scenario
Understand the Idea of Big Data and in Present ScenarioUnderstand the Idea of Big Data and in Present Scenario
Understand the Idea of Big Data and in Present Scenario
 
06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan06. 9534 14985-1-ed b edit dhyan
06. 9534 14985-1-ed b edit dhyan
 
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
 
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
Efficient Data Filtering Algorithm for Big Data Technology in Telecommunicati...
 
Identifying and analyzing the transient and permanent barriers for big data
Identifying and analyzing the transient and permanent barriers for big dataIdentifying and analyzing the transient and permanent barriers for big data
Identifying and analyzing the transient and permanent barriers for big data
 
A Deep Dissertion Of Data Science Related Issues And Its Applications
A Deep Dissertion Of Data Science  Related Issues And Its ApplicationsA Deep Dissertion Of Data Science  Related Issues And Its Applications
A Deep Dissertion Of Data Science Related Issues And Its Applications
 
A Comprehensive Study on Big Data Applications and Challenges
A Comprehensive Study on Big Data Applications and ChallengesA Comprehensive Study on Big Data Applications and Challenges
A Comprehensive Study on Big Data Applications and Challenges
 
Big data storage
Big data storageBig data storage
Big data storage
 
SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16,
SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16, SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16,
SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16,
 
A Survey on Big Data Mining Challenges
A Survey on Big Data Mining ChallengesA Survey on Big Data Mining Challenges
A Survey on Big Data Mining Challenges
 
SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16, .docx
SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16, .docxSPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16, .docx
SPECIAL SECTION ON HEALTHCARE BIG DATAReceived August 16, .docx
 
Next generation big data analytics state of the art
Next generation big data analytics state of the artNext generation big data analytics state of the art
Next generation big data analytics state of the art
 
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTINGSECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
SECURITY ISSUES ASSOCIATED WITH BIG DATA IN CLOUD COMPUTING
 
Security issues associated with big data in cloud computing
Security issues associated with big data in cloud computingSecurity issues associated with big data in cloud computing
Security issues associated with big data in cloud computing
 
big data Big Things
big data Big Thingsbig data Big Things
big data Big Things
 
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...
An Investigation on Scalable and Efficient Privacy Preserving Challenges for ...
 
Challenges and outlook with Big Data
Challenges and outlook with Big Data Challenges and outlook with Big Data
Challenges and outlook with Big Data
 
A Model Design of Big Data Processing using HACE Theorem
A Model Design of Big Data Processing using HACE TheoremA Model Design of Big Data Processing using HACE Theorem
A Model Design of Big Data Processing using HACE Theorem
 

Mehr von IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Kürzlich hochgeladen

welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the weldingMuhammadUzairLiaqat
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774
 
Vishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsVishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsSachinPawar510423
 
Energy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxEnergy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxsiddharthjain2303
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)Dr SOUNDIRARAJ N
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
Solving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.pptSolving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.pptJasonTagapanGulla
 
System Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingSystem Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingBootNeck1
 
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catcherssdickerson1
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...asadnawaz62
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...121011101441
 
Internet of things -Arshdeep Bahga .pptx
Internet of things -Arshdeep Bahga .pptxInternet of things -Arshdeep Bahga .pptx
Internet of things -Arshdeep Bahga .pptxVelmuruganTECE
 
Indian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptIndian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptMadan Karki
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleAlluxio, Inc.
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfROCENODodongVILLACER
 
Mine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptxMine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptxRomil Mishra
 
Transport layer issues and challenges - Guide
Transport layer issues and challenges - GuideTransport layer issues and challenges - Guide
Transport layer issues and challenges - GuideGOPINATHS437943
 
Input Output Management in Operating System
Input Output Management in Operating SystemInput Output Management in Operating System
Input Output Management in Operating SystemRashmi Bhat
 

Kürzlich hochgeladen (20)

welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the welding
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.ppt
 
Vishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documentsVishratwadi & Ghorpadi Bridge Tender documents
Vishratwadi & Ghorpadi Bridge Tender documents
 
Energy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxEnergy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptx
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
Solving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.pptSolving The Right Triangles PowerPoint 2.ppt
Solving The Right Triangles PowerPoint 2.ppt
 
System Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event SchedulingSystem Simulation and Modelling with types and Event Scheduling
System Simulation and Modelling with types and Event Scheduling
 
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor CatchersTechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
TechTAC® CFD Report Summary: A Comparison of Two Types of Tubing Anchor Catchers
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...complete construction, environmental and economics information of biomass com...
complete construction, environmental and economics information of biomass com...
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...
 
Internet of things -Arshdeep Bahga .pptx
Internet of things -Arshdeep Bahga .pptxInternet of things -Arshdeep Bahga .pptx
Internet of things -Arshdeep Bahga .pptx
 
Indian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptIndian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.ppt
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at Scale
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdf
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
Mine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptxMine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptx
 
Transport layer issues and challenges - Guide
Transport layer issues and challenges - GuideTransport layer issues and challenges - Guide
Transport layer issues and challenges - Guide
 
Input Output Management in Operating System
Input Output Management in Operating SystemInput Output Management in Operating System
Input Output Management in Operating System
 

Big Data in Bioinformatics & the Era of Cloud Computing

  • 1. IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661, p- ISSN: 2278-8727Volume 14, Issue 2 (Sep. - Oct. 2013), PP 53-56 www.iosrjournals.org www.iosrjournals.org 53 | Page Big Data in Bioinformatics & the Era of Cloud Computing Prakash Nemade1 , Heena Kharche2 1 (Department of Bioinformatics, Maulana Azad National Institute of Technology, Bhopal MP, India) 2 (Department of Computer Science & Engineering, IES-IPS Academy, Indore MP, India) Abstract: With the recent breakthrough in bioinformatics, demand of more storage space is increasing day by day. With this exponentially increasing data i.e. the big data of bioinformatics sector, the data needs to be handled in more flexible and cost effective manner. With this growth in the volume, variety and velocity of data, cloud computing promises to address big data issues and analysis of challenges of big data in bioinformatics. Big Data of bioinformatics consisting of various sequences (nucleotide or amino acid) which are now showing exponential growth because of high throughput experimental technologies. By the use of various bioinformatics tools demands for large and fast data storage technology is increasing. In this paper, security model is re- drafted for flexible use of model by consumers and various cloud based services and techniques are coined up to make an easy approach for implementing big data of bioinformatics using cloud. Keywords: Big Data, Bio Cloud, Bioinformatics, Cloud, Cloud Computing, Secure Cloud I. INTRODUCTION In general bioinformatics can be defined as ―An interdisciplinary field that develops and improves upon method for storing, retrieving, organizing and analyzing biological data‖ [1].With the emergence of highly efficient bioinformatics technologies, the volume, variety and velocity of data is increasing with a great pace. As reported on February 2012, two nanopore sequencing platforms (GridION and MinION) are capable of delivering ultra-long sequencing reads (~100kb) with additionally higher throughput and much lower cost [2]. With this amount of data, it would be increasingly daunting for small scale industries to invest in a large data server that will be kept in back room, to store your data. Moreover big servers demands for highly skilled people to maintain the data which increases their operating costs. At present, a promising solution to address this challenge is cloud computing, which exploits the full potential of multiple computers and delivers computation and storage as dynamically allocated virtual resources via the Internet [3]. Bigdata in bioinformatics face various implementation challenges which hinders customer to shift their data onto the cloud despite of several advantages. Cloud Computing is here to stay, as it is proposed to transform the way IT is deployed and managed, promising reduced implementation, maintenance costs and complexity, while accelerating innovation, providing faster time to market, and providing the ability to scale high-performance applications and infrastructures on demand [4]. Features like Broad network access, resource pooling, pay as you go, rapid elasticity and on demand access has made cloud computing king of computing. With the use of various services and concepts of cloud computing one can easily handle the problems of big data in bioinformatics. With the better use of the services i.e Software as a service (SaaS), Platform as a service (PaaS) and Infrastructure as a service (Iaas) [4] we can easily access the data at affordable costs. Cloud computing basically uses three Deployment models namely Public cloud: Cloud sharing data of various enterprises on single storage server; Private Cloud: A dedicated cloud storage server of a particular enterprise and Hybrid Cloud: Combination of above two deployment models for easy and flexible use of information. The bioinformatics big data opens up enormous possibilities for discovery, solutions, analysis and even cures. Biological big data are generally challenging for their variety, value, and critical need for veracity. The modern biological data covers diverse collection of omics field like genomics, proteomics, metabolomics, metagenomics, lipidomics and glycomics. The omic data is generated from high throughput experimental technologies. To store and analyze these data requires massive amounts of computational power of databases, data formats, software packages and pipelines [5]. Today, the competition is on for the solutions to analyze and store the data faster and easier and to bring cost down to the point where sequencing becomes easily available for a much wider market. Penta bytes of raw information can be used to provide hints for everything from preventing terabytes to shrinking healthcare costs – if we can figure out how to use the space efficiently. This paper will be heading towards the constructive approach for saving bioinformatics big data using cloud computing and also the challenges that would hinder bioinformatics bigdata to take a step towards cloud. This paper is organized in the following sections: In Section II, we give a background of big data in bioinformatics. We explore in more detail with the example of bigdata in bioinformatics. In Section III, we describe issues of bioinformatics big data. In Section IV, we describe the ways in which cloud will give the solutions to challenges states in Section III.
  • 2. Big Data In Bioinformatics & Era Of Cloud Computing www.iosrjournals.org 54 | Page Finally in section V, we will give the conclusion showing importance of the suggested solutions and their extensions. II. BIOINFORMATICS BIG DATA As we move into the century, we stand at an inflection point in biology how we view and practice biology has forever changed. Biology is becoming data intensive as high throughput experiments are generating data to much greater pace as high throughput experiment technologies are becoming accessible to more number of scientists. One of the inflection points is Human Genome Project (HGP) which is an international scientific research project with a primary goal of determining the sequence of chemical base pairs which make up human DNA, and of identifying and mapping the total genes of the human genome from both a physical and functional standpoint. [6] Human Genome Project provided a genetics parts list and catalyzed the development of high throughput measurement tools (e.g. the high speed DNA sequences, DNA arrays, high throughput mass spectrometry, etc) and high throughput measurement strategies (e.g. yeast two hybrid technique for measuring protein protein interactions) as well as stimulating the development of powerful new computational tools for acquiring, storing and analyzing biological information. The HGP has catalyzed the view that biology is an informational science. Therefore, the huge amount of data generated by these high throughput experiments in the genomic and proteomic arenas requires that databases are built in a way that facilitates not just the storage of these data, but the efficient handling and retrieval of information from these huge databases. [7] The use of DNA sequencing machines, which are smaller in size but capable of generating piles of data faster and at a lower cost, have changed science and medicine in ways never seen before [8]. The current era is beginning to look like the era of ―big data‖; a term refers to explosion of available information, which is byproduct of the digital revolution [9]. However, with biomedical data accumulating in computers and servers around the world [8], concerns over privacy and security of patient data are emerging. Big data has affected several unrelated sectors in society including communications, media, medicine, scientific research among others [9]. However, even though both the computers and the internet have become faster, we have lack of computational infrastructure that is needed to securely generate, maintain, transfer and analyze large scale information in life sciences to integrate omics data with other data sets, such as clinical data from patients. Next Generation Sequencing (NGS) platforms that use semiconductors [10] or nanotechnology [11] have exponentially increased the rate of biological data generation in the last three years. III. ISSUES IN BIOINFORMATICS BIG DATA Big Data generation and acquisition gives birth to profound challenges for storage, transfer and security of the information. Even if companies were forced to limit their data collection and the storage space, still the big data analytics would be needed. Big data storage space would be needed by companies to store their data without any limits. Also, the computational time is needed to be decreased with the increase in the data for faster processing and efficient results. Having large storage and computational infrastructure can be difficult to maintain by the company moreover, implementing cost of this infrastructure may touch the sky. Another challenge is to transfer the data from one location to another; it is mainly done either by the use of external hard disks or by mail. Transfer and accessing of this data becomes time consuming leading to decrease in processing time. Moreover, transfer of data may reduce work efficiency. Big data has to be processed and computed simultaneously so that we can get faster outputs which can be shared and used from any location the user wants to access. Transfer of big data may require large storage space transfer devices which can be time consuming and may sometime lead to inefficient results due to unavailability of updated results in the system. Security and the privacy of data from individuals is also a concern. In every case the most important issue of handling bioinformatics big data is security of the data whether it is in the storage database or while transferring of data via external hard disks or email, security issue is the worry. Data has to be free from any threats as well as data integrity and security has to be maintained. In this growing era of technologies, authenticity and confidentiality of the data is also necessary for the safety of data. IV. CLOUD AS SOLUTION TO BIOINFORMATICS BIG DATA With the increased need to store data and information generated by big projects, an easy and flexible computational solution such as cloud based computing has emerged. Cloud computing is the only storage model that can provide the elastic scale needed for big data in the field of bioinformatics whose volume of output generation is increasing day by day. With the promising technology of cloud nothing is impossible. The issues that are stated above can be solved in one or the other ways described below. 1. Big Data Storage Space: With cloud computing one can use infrastructure that is data is now resided on the outsourced servers. Use of cloud infrastructure which falls under Infrastructure as a Service (one of the
  • 3. Big Data In Bioinformatics & Era Of Cloud Computing www.iosrjournals.org 55 | Page cloud services) [4]. As infrastructure as a service, service of cloud provides you all the necessary resources like server, networking, storage space etc. to be used easily and efficiently, it has become very easy to store the big data. The cost to keep your data on cloud infrastructure is very affordable as cloud uses ―pay as you go‖ feature. It also provides rapid capacity elasticity and easily deliverable whenever and wherever you want. Use of cloud infrastructure allows multi tenancy allowing you to share the data by multiple companies in smart and secure manner. You can automate regular practices—such as provisioning, backup, and replication—and integrate those practices as part of your cloud services, the better your environment will scale. Using this multi tenancy and automation feature a group of people working on the same project may simultaneously get the updated results provided by each other. 2. Data Transfer: Say no to all external hard drives to carry your data, as all your data is on cloud and you can access your data from anywhere anytime or you can share your data by using multi tenant nature of the cloud. Anyone who is authenticated and authorized to access that data will automatically get the updates data which you wanted to transfer. Cloud has reduced the fear of unavailability of data due to hardware failure during transfer to a great extent. Moreover, if you want to switch the data from one cloud service provider to another it can be easily done without any difficulty. Proper service level agreements are signed in order to ensure you the data security at cloud providers’ storage space. 3. Security and Privacy: The major issue to move or store your big data in cloud is its security. Security solutions includes the use of better security systems with advanced encryption algorithms and proper signing of Service level agreements may secure your data to a higher level. As the data is residing on third party storage server, one is very much concerned about his data. In order to trust that the cloud provider has kept your data safe on his premises one should use proper security measures from his side also. Trust can be easily implemented by using the digital certificates [12] with proper transport layer security, to completely rely on the cloud provider, as customers’ data is now within their enterprise boundary as shown by Kharche H. et al. Security Model, Public Root CA and Enterprise CA can be implemented to secure the big data efficiently and effectively. Making the model little more flexible and easy to use by each and every individual according to their need the implementation flow has been re-drafted. The flowchart given in Figure 1consists of three phases: Phase 1: In phase one, according to the needs of the user one can use any encryption algorithm with any key size fulfilling his requirement. Using any encryption algorithm as per user need makes the flow chart more flexible as now the expertise of the encryption algorithm used is in the enterprise boundary. Phase 2: In phase two, the company should create and use his own digital certificates in order to sign the encrypted data resulted in phase 1. The private key generated during your own digital certificate generation should be kept confidential as no one else will be able to open your data now. Secure Your Data With Any Encryption Algorithm with Any Key Size Create Your Own Digital certificates & Sign the Encrypted data Use Transport Layer Security or Secure Socket Layer Security Figure 1: Flowchart to Implement Security and Trust
  • 4. Big Data In Bioinformatics & Era Of Cloud Computing www.iosrjournals.org 56 | Page Phase 3: In phase three, the data is sent over the network in secured form. That is, one can use either of Secure Socket Layer (SSL) or Transport Layer Security (TLS) to send the whole data over the network in the secured format which makes the data triple secure. Above suggested model is very easy and flexible to be used by any company with great ease with all the secrets of the data encryption and the private key in the enterprise boundary. Even though the data is resided at the cloud providers’ storage space, he doesn’t know about how data is secured at the enterprise level. This makes the above model more secure and easy to use. With cloud computing, one can easily overcome all the challenges which were hindering the focus of customers to shift to cloud. All the challenges of storage space, transfer of data and security over cloud makes it very affordable for an individual to move towards cloud. Infrastructure maintenance is done at cloud providers’ end, which reduces the cost of hiring technical expertise and the maintenance of the high data storage servers. V. CONCLUSION Two clear advantages can be gained by using cloud to store bioinformatics bigdata. One, It let companies to analyze massive datasets without capital investment in hardware and host the data internally on the server. Two, hosted model tends to abstract the complexity, enabling more immediate deployment of big data technology. One can easily use the cloud for the bioinformatics big data to large genomic sequences efficiently with proper security, processing speed and affordable costs. Using the re-drafted flow of the model individual can easily gain the trust on cloud. In this era of cloud computing one can easily store enormous amount of data with the easily scalability of resources in cloud Big Data leads to big challenges leading to proper arrangements to data storage in cloud effectively. Bioinformatics big data will need to implement scalable structure to deal with volumes of data generated by omics technologies. Advances in informatics to successfully address the big data challenges that to be faced in next decade should be adapted soon. REFERENCES [1] Wikipedia found at http://en.wikipedia.org/wiki/Bioinformatics [2] Schatz MC, Langmead B, Salzberg SL. Cloud computing and the DNA data race. Nat Biotechnol.2010;28(7):691–693. doi: 10.1038/nbt0710-691. [3] Armbrust M, Fox A, Griffith R, Joseph AD, Katz RH, Konwinski A, Lee G, Patterson DA, Rabkin A,Stoica I, Above the Clouds: A Berkeley View of Cloud Computing. Berkeley: EECS Department, University of California; 2009. [4] Kharche, Ms Heena, and Mr Deepak Singh Chouhan. "Building Trust In Cloud Using Public Key Infrastructure." International Journal of Advanced Computer Science and Applications 3.3 (2012). [5] Higdon et al ―Unraveling the complexities of life sciences data‖. Mary Ann Liebert Inc. Vol. 1, No. 1, Big Data. [6] Wikipedia found at http://en.wikipedia.org/wiki/Human_Genome_Project [7] Andreas D. Baxevanis, B.F. Francis Quellette Bioinformatics: A practical guide to the analysis of genes and proteins. (NJ: Wiley- Interscience, 2005). [8] Costa FF ,Big Data In Biomedicine. Drug Discovery Today. In press (2013) [9] www.nytimes.com/2012/08/12/business/how-bigdat a- became-so-big-unboxed.html?r=0 [10] Rothberg J. M. et al. ―An integrated semiconductor device enabling non-optical genome sequencing‖: Nature 475(7356): 348– 352(2011) [11] Clarke J. et al. ―Continuous base identification for single-molecule nanopore DNA sequencing‖: Nat Nanotechnol. (4): 265–70 (2009) [12] Kharche, Ms Heena, and Mr Deepak Singh Chouhan. "Implementing Trust In Cloud Using Public Key Infrastructure." International Journal of Engineering Inventions 1.5 (2012).