September 8th - 9th, 2020
David Waring: djw64@cornell.edu
T3/Breedbase Production Databases:
T3/Breedbase Sandbox Databases:
Phenotype Upload Instructions:
Germplasm Search Tool:
Video Recordings:
Location Template
Accession Template
Trial Template (trial information and plot layout)
Phenotype Template (trait observations)
T3 is The Triticeae Toolbox:
Each crop has a separate production and sandbox database.
Two ways to submit data:
Detailed upload instructions are available on the sandbox database for each crop:
Below is an excerpt from and a link to the demo trial data I’ll be adding to the database:
Plot | Line | Row | Column | Rep | Block | grain yield (kg/ha) | test weight (g/L) | FHB Incidence (%) | FHB Severity (%) | DON Level (ppm) |
---|---|---|---|---|---|---|---|---|---|---|
101 | VA14W-99 | 1 | 1 | 1 | 1 | 1984.58 | 739.01 | 53.15 | 8.91 | 1.13 |
102 | NC14-23372 | 1 | 2 | 1 | 1 | 2044.19 | 756.15 | 35.59 | 8.08 | 1.75 |
103 | VA13W-38 | 1 | 3 | 1 | 1 | 2356.95 | 787.56 | 36.02 | 20.78 | 1.52 |
104 | NC15-25672 | 1 | 4 | 1 | 1 | 2061.00 | 752.54 | 69.92 | 20.64 | 2.25 |
105 | NC14-23373 | 1 | 5 | 1 | 1 | 1535.94 | 769.60 | 78.59 | 9.40 | 1.38 |
The demo data contains observations from two locations: Ithaca, NY and Geneva, NY. There are ten lines sampled in three reps at each location with five traits observed: grain yield, test weight, FHB incidence, FHB severity, and DON level.
Each crop has a separate production and sandbox database. These are separate websites and copies of the database and as such have separate account management systems. You’ll need to create an account on each database you want access to.
You can create an account by clicking the New User button in the top-right of the toolbar. After filling out the registration form you should receive an account confirmation email from noreply@graingenes.org
(check your Junk/Spam folder if you don’t see it in your Inbox) with a link to confirm your account. Once your account has been confirmed you can log in to the website.
Accounts created on a sandbox database will be automatically granted ‘submitter’ privileges. Accounts created on a production database will have read-only access.
Use Case: You have trials that were located in two different towns. Are these locations already in the database? How do you add a new location?
You can view the locations that are already in the database by going to the Manage > Locations page. Here you’ll see a table and a map of all existing locations to search for your trial locations.
If the location of your trial is not already in the database, there are two ways to add new locations from the Manage > Locations page:
A location template includes the following information:
Name | Abbreviation | Country Code | Country Name | Program | Type | Latitude | Longitude | Altitude | NOAA Station ID |
---|---|---|---|---|---|---|---|---|---|
Geneva, NY | GENNY | USA | United States of America | Cornell University | Farm | 42.880173 | -77.009083 | 187 | GHCND:USC00303177 |
If you’re using R to extract your data to generate the breedbase templates, you can use the breedbase R package to query an API to get the lat, lon, altitude and NOAA Station ID of locations based on street addresses.
Use Case: You have a list of lines that were phenotyped in your trials. Are these lines already in the database? How do you update existing and add new lines?
The Germplasm Search Tool is a separate tool that queries the breedbase database for germplasm lines and matches your line names up with existing database entries using a number of search methods. It is available under the Search menu as Bulk Accession Search. Links for each crop are:
The Germplasm Search Tool can be used to check the experimental list of lines for existing matches in the database and identify lines that:
The ten lines in the demo data are named:
BESS
ERNIE
LES15-5499
LES15-5605
NC14-23372
NC14-23373
NC15-25672
VA13W-174
VA13W-38
VA14W-99
There are two lines that will need to be added as Accessions in the database:
NC15-25672
VA14W-99
The Accession upload template contains properties about the germplasm lines being added to the database. If an entry exists with the same name in the database, the properties will be updated with the ones in the template. At a minimum, the accesion_name and species_name properties are required.
An Accession template can include the following information:
accession_name | species_name | population_name | organization_name(s) | synonym(s) | variety(s) | country_of_origin(s) | notes(s) | accession_number(s) | purdy_pedigree | filial_generation |
---|---|---|---|---|---|---|---|---|---|---|
NC15-25672 | Triticum aestivum | North Carolina State University | NC15_25672 | PI 1234 | NC15-1/NC15-2 | 9 | ||||
VA14W-99 | Triticum aestivum | Virginia Tech | VA14W 99 | CItr 5678 | VA14W-1/VA14W-2 | 9 |
To upload an Accession template:
Once your upload has been processed, you can find your new Accession(s) by going to the Search > Accessions and Plots page and entering the name of one of your Accessions.
Click the name of your Accession in the results table to open the Accession detail page. Here the Additional Info section should include the property values from your upload template.
Use Case: You have one or more trials that were previously designed. You know the plot layout of the trials. How do you add the trials to the database?
There are multiple methods of adding a trial to the database. For this example, we’ll be adding an existing trial by uploading a trial template file (not using the trial design tool). There are two trial template types:
The following properties relate to the trial:
The following properties relate to a plot:
1
for a control/check and 0
(or blank) for an experimental lineThe trial upload template combines the trial properties (which are repeated for each plot) and the plot properties.
trial_name | breeding_program | location | year | design_type | description | trial_type | plot_width | plot_length | field_size | planting_date | harvest_date | plot_name | accession_name | plot_number | block_number | is_a_control | rep_number | range_number | row_number | col_number | seedlot_name | num_seed_per_plot | weight_gram_seed_per_plot |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
AYT_2019_Ithaca | Cornell University | Ithaca, NY | 2019 | RCBD | 2019 Advanced Yield Trial @ Ithaca | Advanced Yield Trial | 1 | 1 | 2019-06-15 | 2019-10-11 | AYT_2019_Ithaca-PLOT101 | VA14W-99 | 101 | 1 | 1 | 1 | 1 | ||||||
AYT_2019_Ithaca | Cornell University | Ithaca, NY | 2019 | RCBD | 2019 Advanced Yield Trial @ Ithaca | Advanced Yield Trial | 1 | 1 | 2019-06-15 | 2019-10-11 | AYT_2019_Ithaca-PLOT102 | NC14-23372 | 102 | 1 | 1 | 1 | 2 | ||||||
AYT_2019_Ithaca | Cornell University | Ithaca, NY | 2019 | RCBD | 2019 Advanced Yield Trial @ Ithaca | Advanced Yield Trial | 1 | 1 | 2019-06-15 | 2019-10-11 | AYT_2019_Ithaca-PLOT103 | VA13W-38 | 103 | 1 | 1 | 1 | 3 | ||||||
AYT_2019_Ithaca | Cornell University | Ithaca, NY | 2019 | RCBD | 2019 Advanced Yield Trial @ Ithaca | Advanced Yield Trial | 1 | 1 | 2019-06-15 | 2019-10-11 | AYT_2019_Ithaca-PLOT104 | NC15-25672 | 104 | 1 | 1 | 1 | 4 | ||||||
AYT_2019_Ithaca | Cornell University | Ithaca, NY | 2019 | RCBD | 2019 Advanced Yield Trial @ Ithaca | Advanced Yield Trial | 1 | 1 | 2019-06-15 | 2019-10-11 | AYT_2019_Ithaca-PLOT105 | NC14-23373 | 105 | 1 | 1 | 1 | 5 |
To add the trial(s) to the database:
You can view the trial(s) you added by finding them in the table on the Search > Field Trials page (you can filter the table by entering the trial name in the search box in the top right corner of the table).
You can open the trial detail page by clicking on the name of the trial in the table. From this page you can check the properties of the trial. The Field Layout Tools and Phenotype Heatmap section contains a generated field layout with the plot positions. The Experimental Design section contains information on the field design and the accessions used in the trial.
Use Case: You have a list of traits that were observed in your trials. Are these traits available in the database? How do you find the traits? What do you call the traits when uploading data?
Breedbase manages traits differently than we did with the original version of T3. Trait management may seem more complicated, but this allows us to more accurately and more precisely define the traits. In addition, trait observations can now be more easily compared to data stored in other databases that use the same trait definitions.
On the original T3, there was a T3 trait dictionary that was a list of trait names with descriptions that defined each individual trait. Trait observations were added to the database by linking each trait value with the T3 trait name.
On breedbase, traits are organized in a trait ontology, which is a controlled vocabulary of trait terms and definitions. The ontology is managed in a collaborative effort through The Generation Challenge Programme’s Crop Ontology project and is made publicly available online on their website. T3 has contributed trait definitions for all three of our crops:
The trait ontology is organized in a tree-like hierarchical structure. Under the root of the tree are trait categories - these are broad categories for grouping similar traits (such as Agronomic, Biotic Stress, Quality, etc). Directly under the trait categories are ontology trait terms, where a trait (in ontology terms) is the entity that is being measured (such as grain yield) but does not include any information about how it is measured or with what units. Directly under the trait is one or more ontology variable terms. The variable term contains information for a single trait as well as information about the method in which the trait was measured and the units/scale the value is recorded in. Observations made from a phenotyping trial must always be associated with a trait ontology variable.
To view the traits in the trait ontology, open the ontology browser by going to the Manage > Trait Ontology Browser page. This page will display all of the ontologies loaded into the database - the trait ontology will be labeled Wheat traits, Barley Traits, or Oat traits. Direct links to the trait ontology for each crop are:
Here you can navigate through the ontology tree to find the ontology variable that matches your trait.
Traits can also be searched by name by going to the Search > Traits page. Make sure the trait ontology is selected in the search settings and enter a term to search for.
From the results table, you can choose a term that has a type set to VARIABLE_OF
to associate with your trait data.
When loading your phenotype observations you’ll need to know both the trait name and trait id of the trait variable that matches your observed trait. For example, for wheat grain yield in kg/ha you’ll need to know:
If you’re familiar with the old trait names used on T3/Classic, you can use the lookup tables that link the old trait names to their corresponding breedbase trait variable names and IDs. There is a trait lookup table available for each crop:
If there is no corresponding trait variable that matches your observed trait, you can request to have a new trait added to the ontology. You can either contact us directly or use the trait request form.
If your trait only differs in the units/scale that was measured, we ask that you convert your values to match the scale of an existing trait variable.
Use Case: You have phenotype observations from one or more trials that have already been added to the database. How do you add the observations to the trials?
We’ll be adding phenotype observations to the new trials we created by creating a phenotype upload template. We’ll be creating the template manually, but a blank template with the proper trait column headers can be created by the website (from the trial detail page) if you have the traits you observed already in a list.
We’ll be using the simple phenotyping spreadsheet format. This format has one required column for the plot name followed by a column for each observed trait. The column headers are:
Below is an excerpt of the trait observations from the demo dataset:
observationunit_name | Grain yield - kg/ha|CO_321:0001218 | Grain test weight - g/l|CO_321:0001210 | FHB incidence - %|CO_321:0001149 | FHB severity - %|CO_321:0001440 | FHB DON content - ppm|CO_321:0001154 |
---|---|---|---|---|---|
AYT_2019_Ithaca-PLOT101 | 1984.58 | 739.01 | 53.15 | 8.91 | 1.13 |
AYT_2019_Ithaca-PLOT102 | 2044.19 | 756.15 | 35.59 | 8.08 | 1.75 |
AYT_2019_Ithaca-PLOT103 | 2356.95 | 787.56 | 36.02 | 20.78 | 1.52 |
AYT_2019_Ithaca-PLOT104 | 2061.00 | 752.54 | 69.92 | 20.64 | 2.25 |
AYT_2019_Ithaca-PLOT105 | 1535.94 | 769.60 | 78.59 | 9.40 | 1.38 |
Multiple trials can be uploaded at once with the same phenotype upload template.
To upload the phenotype template:
You can view the trait observations from the trial detail page (find your trial in the Search > Field Trials page and click the trial name to get to the trial detail page).
In the Phenotype Summary Statistics section you can view a table of trait means and summary statistics and a histogram of trait observations for the trial. Here you can also download a table of all of the trait observations for the trial.
Once you have your trial(s) up on the sandbox, you can submit them to be included on the production database.
From the trial detail page, there is a Submit Trial button near the top of the page. This will tell us that the trial is ready for submission. We will review it and add it to the production database.
There are tools available on the website that can help generate some of the templates we created manually.
Breedbase is tightly integrated with the Android Fieldbook app which can be used to collect phenotype data on an Android tablet or phone directly in the field. The website can be used to create the field layout files that are used to load your field trial into the app. The data files created by the app can then be loaded directly into Breedbase to store the phenotype observations.
If you’re using R to interact with your data, you can use the Breedbase R package to generate the upload templates.
This is a tool that was available on T3/Classic and ported over to breedbase. It can be used to generate LSMeans and LSDs for one or more traits across one or more trials. It is accessible from the Analyze > Summarize Trials menu.
To use the tool, you will first need to have the trials you are interested in summarizing in a list.
These tools are included with breedbase and are available in the Analysis Tools section of a trial detail page.
These tools are built in to breedbase and could be made available.
Some of these tools are not yet available because they require some work from us to get them up and running. If you are interested in using any of these tools, please reach out to us and let us know. This will allow us to prioritize setting up tools that we know people are interested in using.