CP4071 Bio Informatics Syllabus:

CP4071 Bio Informatics Syllabus – Anna University PG Syllabus Regulation 2021

COURSE OBJECTIVES:

 Exposed to the need for Bioinformatics technologies
 Be familiar with the modeling techniques
 Learn microarray analysis
 Exposed to Pattern Matching and Visualization
 To know about Microarray Analysis

UNIT I INTRODUCTION

Need for Bioinformatics technologies – Overview of Bioinformatics technologies Structural bioinformatics – Data format and processing – Secondary resources and applications – Role of Structural bioinformatics – Biological Data Integration System.

UNIT II DATAWAREHOUSING AND DATAMINING IN BIOINFORMATICS

Bioinformatics data – Data warehousing architecture – data quality – Biomedical data analysis – DNA data analysis – Protein data analysis – Machine learning – Neural network architecture and applications in bioinformatics.

UNIT III MODELING FOR BIOINFORMATICS

Hidden Markov modeling for biological data analysis – Sequence identification – Sequence classification – multiple alignment generation – Comparative modeling –Protein modeling – genomic modeling – Probabilistic modeling – Bayesian networks – Boolean networks – Molecular modeling – Computer programs for molecular modeling.

UNIT IV PATTERN MATCHING AND VISUALIZATION

Gene regulation – motif recognition – motif detection – strategies for motif detection – Visualization – Fractal analysis – DNA walk models – one dimension – two dimension – higher dimension – Game representation of biological sequences – DNA, Protein, Amino acid sequences.

UNIT V MICROARRAY ANALYSIS

Microarray technology for genome expression study – image analysis for data extraction – preprocessing – segmentation – gridding – spot extraction – normalization, filtering – cluster analysis – gene network analysis – Compared Evaluation of Scientific Data Management Systems – Cost Matrix – Evaluation model – Benchmark – Tradeoffs.

TOTAL: 45 PERIODS

LIST OF EXPERIMENTS:

1. Manipulating DNA strings
2. Use Protein Data Bank to visualize and Analyze the Proteins from protein database
3. Explore the Human Genome with the SciPy Stack
4. Hidden Markov Model for Biological Sequence
5. Molecular Modeling using MMTK package
6. Sequence Alignment using Biopython, Pairwise and multiple sequence alignment using ClustalW and BLAST
7. Simple generation and manipulation of genome graphs
8. DNA data handling using Biopython
9. Chaos Game Representation of a genetic sequence
10. Visualize the microarray data using Heatmap

TOTAL: 30 PERIODS

COURSE OUTCOMES:

CO1: Understand the different Data formats
CO2: Develop machine learning algorithms.
CO3: Develop models for biological data.
CO4: Apply pattern matching techniques to bioinformatics data – protein data genomic data.
CO5: Apply micro array technology for genomic expression study.

TOTAL: 45 +30=75 PERIODS

REFERENCES

1. Yi-Ping Phoebe Chen (Ed), “Bio Informatics Technologies”, First Indian Reprint, Springer Verlag, 2007.
2. Bryan Bergeron, “Bio Informatics Computing”, Second Edition, Pearson Education, 2015.
3. Arthur M Lesk, “Introduction to Bioinformatics”, Second Edition, Oxford University Press, 2019