Modeling Functional Genomics Datasets CVM8890-101 Lesson 1 13

Modeling Functional Genomics Datasets CVM8890-101 Lesson 1 13

Modeling Functional Genomics Datasets CVM8890-101 Lesson 1 13 June 2007 Bindu Nanduri

Lesson 1: Data to Biological sense. What we are trying to achieve. Introduction to functional genomics modeling strategies.

Transcriptomics and Proteomics Why study gene expression changes?????

Transcription is predominant form of regulation Northern Blots Mol Vis.

1996 Nov 4;2:11 Microarrays Basic concept: Reverse Northern blot on a large scale

High throughput: hybridize control and experimental samples simultaneously using distinct fluorescent dyes many assays can be carried out in parallel Affymetrix oligo arrays design

Usually the most 3 prime area, often UTR AAAA.. 25mer 25mer

25mer (11 to 16) 25mer Genomic Tiling Array Design Genome Sequence

5 3 Multiple probes

Center-Center Resolution 38 bp ISB Systems Biology Course 2006 Is mRNA level = Protein level?

Is there a correlation??? Comparison of protein levels (MS, 2D gels) and RNA levels (SAGE) for 156 genes in yeast mRNA levels unchanged, but protein levels varied by up to 20X protein levels unchanged, but mRNA levels varied by up to 30X Highly expressed mRNAs correlate well with protein levels

Gygi et al. (1999) ISB Systems ISB Systems Biology

Course 2006 Expressed Sequence Tags ESTspieces of DNA sequence (usually 200 to 500 nt) generated by sequencing either one or both ends of an expressed gene Bits of DNA that represent genes expressed in certain cells, tissues,

or organs from different organisms and Can be useful "tags" to fish a gene out of a portion of chromosomal DNA by matching base pairs

EST Sequence Clustering Gene can be expressed as mRNA many,many times, ESTs derived from this mRNA may be redundant many identical, or similar, copies of the same EST redundancy and overlap means that when someone searches dbEST for a particular EST, they may retrieve a long list

of tags, many of which may represent the same gene UniGene database automatically partitions GenBank sequences into a non-redundant set of gene-oriented clusters

ESTs: EST mapping to the genome, annotation differential expression Transcriptome: Clustering, differential expression analysis Proteome: differential expression analysis

Multiple data analysis platforms Proteomics Trans criptomics

ES T analysis LIST of elements

Modeling Function Modeling function requires: knowing the components of the system (structural annotation) knowing what these components do & how they interact

(functional annotation) Where do you begin???? Specifics

Transcriptome Analysis Clustering Similar expression patterns = similar regulation?

clustering algorithms help us identify patterns in plex data Key Goal: identify co-regulated groups of genes

Hierarchical clustering K-means clustering Self organizing feature maps Principal component analysis Proteomics

Qualitative : total number of identified proteins data intersections Quantitative: changes in protein expression Proteomic data analysis tools

Use GO for. Grouping gene products by biological function Determining which classes of gene products are overrepresented or under-represented Focusing on particular biological pathways and functions (hypothesis-driven data interrogation)

Relating a proteins location to its function Course Overview Introduction to functional annotation. Orthologs and homologs; clusters of orthologous genes (COGs) and the gene ontology (GO); and how to find what functional annotation is available

Tools for functional annotation. Accessing functional data; computational strategies to obtain more complete functional annotation; the AgBase GO annotation pipeline. Introduction to pathways analysis. Theory and strategies for pathway analysis modeling in different species and tools for pathway analysis.

Functional genomics modeling : prokaryotic and eukaryotic examples Some Useful Links (comprehensive access to information regarding complete and ongoing genome projects around the world.) (provides a controlled vocabulary to describe gene and gene product attributes in any organism) (integrated protein informatics resource for genomics and proteomics) (protein database) (maintains a set of generic databases as well as the

systematic comparative analysis of microbial, fungal, and plant genomes.) (comprehensive resource for public databases, literature and tools) (System that maintains automatic annotation of large eukaryotic genomes) (expert protein analysis system) (BioCyc is a collection of 260 Pathway/Genome Databases: metabolic pathways) (biological systems" database integrating both molecular building block information and higher-level systemic information) Some Useful Links (functional genomics studies on a variety of pathogens for which genomic sequence information is currently, or will soon be, available) (comprehensive resource for microbial genomics) (High throughput proteome annotations) (Arabidopsis resources) (systems biology portal) (mathematical models of biological interests) (species-specific

collections of genes and annotation) (Microarray analysis resources) (Database for Annotation, Visualization and Integrated Discovery) (swine genetics community)

Some Useful Links (pathways and tools for analysis) (database of human genes that includes automatically-mined genomic, proteomic and transcriptomic

information, as well as orthologies, disease relationships, SNPs, gene expression, gene function, and service links for ordering assays and antibodies) (proteomics tools) (open access institute) (A network of genes and proteins extends through the scientific literature) (comparative analysis of protein sequence) (genome-scale algorithm for grouping ortholog protein sequences) (ortholog prediction program) (transcription factor database) Some Useful Links (curated knowledgebase of biological

pathways) Virtual Library of Biochemistry,Moleculer Biology and Cell Biology) (Stanford genomic resources) (collection of tools for annotation and analysis of sequences) (prediction of transmembrane domains in proteins) (subcellular localization predictions) (prediction of membrane-spanning regions and their orientation) (functional analysis of

agricultural plant and animal gene products)

Recently Viewed Presentations

  • Verbs


    Action Verb tell what the subject is doing. ... -Action verbs can be actions of the body or mind. Examples of action verbs: talk laugh think jump. Action Verbs Linking verbs act as an equal sign ( ) for a...
  • Cell Cycle, Mitosis, and Meiosis

    Cell Cycle, Mitosis, and Meiosis

    Cell Cycle, Mitosis, and Meiosis Why do cells divide? Most cells go through a series of changes in order to maintain homeostasis. Cells need to reproduce (divide) when their surface area can no longer supply the much larger volume with...
  • Chapter 15: Reconstructing a Nation, 1865-1877 Presentation

    Chapter 15: Reconstructing a Nation, 1865-1877 Presentation

    How did both Indian and French fur traders—in terms of appearance, lifestyle, language, etc.—come to embody the middle ground? ... John White's Watercolor of an Algonquian Village. Service Historique de la Marine, Vincennes, France/Bridgeman Images.
  • Nicole Mihai, OCE, Program Manager

    Nicole Mihai, OCE, Program Manager

    Ontario Centres of Excellence (OCE) Canadian Manufacturers and Exporters (CME) ... Have a vision to be market leaders in their chosen sector. Project eligibility. ... Projects must fall within technology readiness levels TRL1-4 . Project eligibility. page . Application Process.
  • CSP North East Regional Network -

    CSP North East Regional Network -

    Draft LBP Guidelines - Diarmaid Ferguson, Clinical Specialist in Rehabilitation, Northumbria Healthcare NHS Trust. Talipes- Changes "afoot" - Dr Amanda Trees - ESP - South Tees NHS Foundation Trust "Post ICU Rehab - is it survival of the fittest? "...
  • Adjectives, Accusatives and Word Order

    Adjectives, Accusatives and Word Order

    magnum . parvus, small, little. parva, parvum. amant. they love, like. portant. they carry. Finding Direct Objects. A direct object follows an ACTION verb (not a linking verb like "is" and "are"). The . direct object . is the noun...
  • Lecture 3 John Woodward Copying Numbers int luckyNumber

    Lecture 3 John Woodward Copying Numbers int luckyNumber

    Author: John R Woodward Created Date: 09/08/2013 02:40:39 Title: PowerPoint Presentation Last modified by: John R Woodward
  • Sunflower Project Change Agent Network Meeting #8 January

    Sunflower Project Change Agent Network Meeting #8 January

    2. Load in DA-118 encs. 3. Clear suspended tranactions. Last day for central activity for prior FY transactions in STARS. SOKI. Agency Systems. 1. Begin manual entry of items 2. Convert supplier contracts 3. Convert Bidders 4. Extract data (for...