DESIGN AND DEVELOPMENT OF BIOINFORMATICS FEATURE BASED DNA SEQUENCE DATA COMPRESSION ALGORITHM

Design and Development of Bioinformatics Feature Based DNA Sequence Data Compression Algorithm

Design and Development of Bioinformatics Feature Based DNA Sequence Data Compression Algorithm

Blog Article

INTRODUCTION: Genetic data plays a key role in the healthcare area in specific, but they are typically very large in size.Many research shows that absence of DNA information at the right time is one of the major causes of error in thehealthcare area.The more genomics information that analysts secure, the better the prospects for individual and generalwellbeing.Persevering and retrieving genetic information in the right form within the given time is a big challenge in thefield of Healthcare.

Effectively, pre-birth DNA tests screen for formative variations from the norm.Before long, patientswill have their blood sequenced to detect any nonhuman DNA that may flag an irresistible illness.Later on, somebodymanaging malignancy will most likely track the movement of the sickness by having the DNA and RNA of single cellsfrom various tissues sequenced every day.DNA sequencing of whole population will give a complete and better predictionof population wellbeing.

OBJECTIVES: Hereditary data is growing exponentially; hence it is hard to deal with the consistently developinghereditary database.The human genome in its base configuration occupies almost thirty terabyte of storage space.Computational assets are constrained.Not just storage, transmission abilities and run time memory is likewise constrained.

Data Compression is a test when the hereditary information is exponentially expanding.It is critical to save the Chlorella integrity ofhereditary information while packing it.Hence the main objective of this paper is to develop a lossless DNA compressionalgorithm that not only gives better compression but also help in retrieval of Information for efficient use in the area ofHealthcare.METHODS: In this paper a lossless hereditary data compression method is being proposed.

The proposed calculationworks in a horizontal mode and utilization a reference based substitution technique for compression.The principle thoughtof this paper is in the kind of similarity scanned.All the predominant hereditary Compression methods search forsimilarity within the chromosome.These calculations either pursue flat mode or vertical mode for accomplishingcompression.

But whichever method the existing genetic compression algorithms use, they are all based on searchingsimilarities within the chromosome i.e.they exploit only inter chromosomal similarities.The current studies focus willshow that compression ratio achieved by analyzing individual chromosome is always less than the method in which weanalyze and compress intra chromosomal similarities.

RESULTS: This study shows that by simply using exactly matching repeats amongst all the chromosomes of the samegenome, not only the compression ratio is improving but also a detailed study of all the similarities and differencesbetween two genomes of the same species can be conducted.CONCLUSION: In this study, a new compression algorithm is being proposed for compressing DNA.Along with Interchromosomal similarities, Intra chromosomal similarities are considered for this method.The results clearly shows thatintra chromosomal matches are bigger and more than inter chromosomal matches which helps us to achieve Cinema Lens bettercompression ratio.

Report this page