<mods:mods version="3.3" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-3.xsd" xmlns:mods="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"><mods:titleInfo><mods:title>SpaTemHTP: A Data Analysis Pipeline for Efficient Processing and Utilization of Temporal High-Throughput Phenotyping Data</mods:title></mods:titleInfo><mods:name type="personal"><mods:namePart type="given">S</mods:namePart><mods:namePart type="family">Kar</mods:namePart><mods:role><mods:roleTerm type="text">author</mods:roleTerm></mods:role></mods:name><mods:name type="personal"><mods:namePart type="given">V</mods:namePart><mods:namePart type="family">Garin</mods:namePart><mods:role><mods:roleTerm type="text">author</mods:roleTerm></mods:role></mods:name><mods:name type="personal"><mods:namePart type="given">J</mods:namePart><mods:namePart type="family">Kholová</mods:namePart><mods:role><mods:roleTerm type="text">author</mods:roleTerm></mods:role></mods:name><mods:name type="personal"><mods:namePart type="given">V</mods:namePart><mods:namePart type="family">Vadez</mods:namePart><mods:role><mods:roleTerm type="text">author</mods:roleTerm></mods:role></mods:name><mods:name type="personal"><mods:namePart type="given">S S</mods:namePart><mods:namePart type="family">Durbha</mods:namePart><mods:role><mods:roleTerm type="text">author</mods:roleTerm></mods:role></mods:name><mods:name type="personal"><mods:namePart type="given">R</mods:namePart><mods:namePart type="family">Tanaka</mods:namePart><mods:role><mods:roleTerm type="text">author</mods:roleTerm></mods:role></mods:name><mods:name type="personal"><mods:namePart type="given">H</mods:namePart><mods:namePart type="family">Iwata</mods:namePart><mods:role><mods:roleTerm type="text">author</mods:roleTerm></mods:role></mods:name><mods:name type="personal"><mods:namePart type="given">M O</mods:namePart><mods:namePart type="family">Urban</mods:namePart><mods:role><mods:roleTerm type="text">author</mods:roleTerm></mods:role></mods:name><mods:name type="personal"><mods:namePart type="given">J</mods:namePart><mods:namePart type="family">Adinarayana</mods:namePart><mods:role><mods:roleTerm type="text">author</mods:roleTerm></mods:role></mods:name><mods:abstract>The rapid development of phenotyping technologies over the last years gave the&#13;
opportunity to study plant development over time. The treatment of the massive&#13;
amount of data collected by high-throughput phenotyping (HTP) platforms is however&#13;
an important challenge for the plant science community. An important issue is to&#13;
accurately estimate, over time, the genotypic component of plant phenotype. In outdoor&#13;
and field-based HTP platforms, phenotype measurements can be substantially affected&#13;
by data-generation inaccuracies or failures, leading to erroneous or missing data. To&#13;
solve that problem, we developed an analytical pipeline composed of three modules:&#13;
detection of outliers, imputation of missing values, and mixed-model genotype adjusted&#13;
means computation with spatial adjustment. The pipeline was tested on three different&#13;
traits (3D leaf area, projected leaf area, and plant height), in two crops (chickpea,&#13;
sorghum), measured during two seasons. Using real-data analyses and simulations,&#13;
we showed that the sequential application of the three pipeline steps was particularly&#13;
useful to estimate smooth genotype growth curves from raw data containing a large&#13;
amount of noise, a situation that is potentially frequent in data generated on outdoor&#13;
HTP platforms. The procedure we propose can handle up to 50% of missing values. It&#13;
is also robust to data contamination rates between 20 and 30% of the data. The pipeline&#13;
was further extended to model the genotype time series data. A change-point analysis&#13;
allowed the determination of growth phases and the optimal timing where genotypic&#13;
differences were the largest. The estimated genotypic values were used to cluster the&#13;
genotypes during the optimal growth phase. Through a two-way analysis of variance&#13;
(ANOVA), clusters were found to be consistently defined throughout the growth duration.&#13;
Therefore, we could show, on a wide range of scenarios, that the pipeline facilitated&#13;
efficient extraction of useful information from outdoor HTP platform data. High-quality&#13;
plant growth time series data is also provided to support breeding decisions. The R&#13;
code of the pipeline is available at https://github.com/ICRISAT-GEMS/SpaTemHTP.</mods:abstract><mods:classification authority="lcc">Crop Physiology</mods:classification><mods:originInfo><mods:dateIssued encoding="iso8061">2020-11</mods:dateIssued></mods:originInfo><mods:originInfo><mods:publisher>Frontiers Media</mods:publisher></mods:originInfo><mods:genre>Article</mods:genre></mods:mods>