Introduction to NRSP databases
Download
Report
Transcript Introduction to NRSP databases
Introduction to NRSP databases
and other breeding databases
Topic
Why do we need integrated databases?
Data/Tools in NRSP databases
Demo of NRSP databases
Accessing genomic and genetic data
Accessing Breeding data
Introduction to other breeding databases
Future development
Integrated Databases - To Facilitate Discovery
Genomics
Basic Science
Structure and
evolution of
genome, gene
function, genetic
variability,
mechanism
underlying traits
Diversity
Genetics
Integrate
d Data &
Tools
Germplasm
Breeding
Translational
Science
QTL /marker
discovery,
genetic mapping,
Breeding values
Applied Science
Management of breeding data
Utilization of DNA information in breeding decisions
Data and Tools in NRSP databases
•
•
•
•
Genomics Data
• Whole genome sequences and annotation
• Transcriptome
• Gene database (genes with gene symbols)
• Synteny data
• Pathway data
Genetics Data
• Markers and maps
• QTL/Molecular diversity
Breeding Data
• Genotypic data
• Phenotypic data
• Germplasm data
Tools
• Genome browser/CMap (comparative map viewer)
• Cyc (pathway viewer)
• Blast
• Breeding Decision Tools
Accessing Genomic and Genetic data
in NRSP databases
Species Pages
Sequence Search
Sequence Search Results
Search Details Page
CottonGen JBrowse
Marker Search (Citrus)
Trait Loci (QTL/MTL) Search
Database and Tools for Breeding
Data in GDR
Data
Private data from WA apple breeding program
Private and Public breeding data from RosBreed
project
(apple, strawberry, peach, sweet cherry, tart cherry)
Interface
Data Management (Browse, Search and Download)
Data Conversion (Generate Input files for Pedimap)
Decision Support
Cross Assist
Data Management
Search/Download Phenotypic Data
Search/Download Genotypic Data
Creating Website for Each Group
Phenotypic Data Search
18
19
Variety Detail Page
Genotypic Data Search
21
Homepage of a private breeding
program
Data Conversion
Generate Input files for Pedimap, a tool for exploring
and visualizing the flow of phenotypes and alleles
through pedigrees
Choose Pedigree
Choose Trait and Modify Values
Generate a File and View in Pedimap
Decision Support
Cross Assist
What is Cross Assist?
A web interface to generate a list of parents and
the number of seedlings to get the progeny with
desired traits
Methods
“Phenotype” (uses only phenotypic
information of individuals in the dataset),
“+Pedigree” (uses both phenotypic and
pedigree information)
“+Ped+DNA” (uses phenotypic, pedigree
information and information provided by
DNA-based functional genotypes).
Step 1: Select Method
Step 2: Select target number and trait
thresholds
Step 3: Filter results by data completeness,
required number of seedlings, and parentage
Other breeding database/breeding
management system (BMS)
BMS from Integrated Breeding Platform and
CassavaBase
Nonproprietary
Both are compatible with Field Book App
Breeding data in Chado (open source database
schema)
BMS from IBP
CassavaBase
Runs in PC
Online Database
Compatible with Field Book App
Compatible with Field Book App
Data is stored partially in Chado
Data is stored entirely in Chado
Breeder oriented
Breeder oriented
More extensive statistical tools
NRSP databases
(Tripal BMS)
Compatible with Field Book App
Data stored entirely in Chado
Online database
Comprehensive genomic and genetic database with
a Breeding Management System
Built in Tripal : open-source that can be adopted by
any other crop databases
Features of IBP and CassavaBase
Data management (IBP and CassavaBase)
Data import (From Field Book App and templates)
Managing germplasm lists, crosses, nurseries/trials and making field
map
Query/Browse
Statistical tools (IBP)
Single site/Multi site analysis (GxE) for phenotypic data
QTL analysis
Marker-Assisted Breeding tools
Molecular Breeding design tools (IBP)
OptiMAS (IBP)
Genomic Selection (CassavaBase)
Data Management Tools in BMS (IBP)
Data Management Tools
(CassavaBase)
Create Lists
Create
Field Book File
Download File
Load to Field Book
Statistical Tools in BMS (IBP)
Single site analysis:
produces adjusted means per genotype, as well as summary statistics to
describe the data.
View box plot for a trait
and run the analysis again if
a valued need to be excluded
File of predicted
means
Choose a dataset
Future Development in NRSP
databases (Tripal BMS)
Data Management
Interface for data import both from templates and Field Book App
Web interface to design/store new crosses/field maps and trials
More query interfaces
More data conversion tools
Decision Tools
Further development of the current tools
New tools (Statistical tools, QTL discovery tool)
Acknowledgements
Mainlab Bioinformatics Team
Sook Jung, Taein Lee, Chun-Huai Chen, Stephen Ficklin, Jing Yu, Jodi
Humann, Ping Zheng, Anna Blenda, Sushan Ru, Yu Ma, Dorrie Main
Project coPIs/Pis and collaborators
tfGDR (GDR and Citrus); Cacao Genome Database; Pine Genome
Sequencing Project; Genome Database for Vaccinium; Cool Season
Food Legume Database; CottonGen; RosBreed
Kate Evans (Breeders Toolbox for GDR), Cameron Peace (Cross Assist)
Rosaceae, Citrus, Cacao, Blueberry, Legume, Cotton research
communities and Bioinformatics Communities
USDA NIFA SCRI, USDA DOE, NSF Plant Genome Program, USDAARS, SAAEDS, Mars Inc, Washington Tree Fruit Research
Commission, Cotton Incorporated, USA Dry Pea and Lentil
Commission, Northern Pulse Growers, Citrus Research Commission
US Land Grant University researchers and extension agents
The Team