6. A miRNA Signature Predictive of Early Recurrence
Microarray de miRNA de Affymetrix
6
A microRNA Signature Associated with Early Recurrence
in Breast Cancer
Luis G. Pe´rez-Rivas1.
, Jose´ M. Jerez2.
, Rosario Carmona3
, Vanessa de Luque1
, Luis Vicioso4
,
M. Gonzalo Claros3,5
, Enrique Viguera6
, Bella Pajares1
, Alfonso Sa´nchez1
, Nuria Ribelles1
,
Emilio Alba1
, Jose´ Lozano1,5
*
1 Laboratorio de Oncologı´a Molecular, Servicio de Oncologı´a Me´dica, Instituto de Biomedicina de Ma´laga (IBIMA), Hospital Universitario Virgen de la Victoria, Ma´laga,
Spain, 2 Departamento de Lenguajes y Ciencias de la Computacio´n, Universidad de Ma´laga, Ma´laga, Spain, 3 Plataforma Andaluza de Bioinforma´tica, Universidad de
Ma´laga, Ma´laga, Spain, 4 Servicio de Anatomı´a Patolo´gica, Instituto de Biomedicina de Ma´laga (IBIMA), Hospital Universitario Virgen de la Victoria, Ma´laga, Spain,
5 Departmento de Biologı´a Molecular y Bioquı´mica, Universidad de Ma´laga, Ma´laga, Spain, 6 Departmento of Biologı´a Celular, Gene´tica y Fisiologı´a Animal, Universidad de
Ma´laga, Ma´laga, Spain
Abstract
Recurrent breast cancer occurring after the initial treatment is associated with poor outcome. A bimodal relapse pattern
after surgery for primary tumor has been described with peaks of early and late recurrence occurring at about 2 and 5 years,
respectively. Although several clinical and pathological features have been used to discriminate between low- and high-risk
patients, the identification of molecular biomarkers with prognostic value remains an unmet need in the current
management of breast cancer. Using microarray-based technology, we have performed a microRNA expression analysis in
71 primary breast tumors from patients that either remained disease-free at 5 years post-surgery (group A) or developed
early (group B) or late (group C) recurrence. Unsupervised hierarchical clustering of microRNA expression data segregated
tumors in two groups, mainly corresponding to patients with early recurrence and those with no recurrence. Microarray
data analysis and RT-qPCR validation led to the identification of a set of 5 microRNAs (the 5-miRNA signature) differentially
expressed between these two groups: miR-149, miR-10a, miR-20b, miR-30a-3p and miR-342-5p. All five microRNAs were
down-regulated in tumors from patients with early recurrence. We show here that the 5-miRNA signature defines a high-risk
group of patients with shorter relapse-free survival and has predictive value to discriminate non-relapsing versus early-
relapsing patients (AUC = 0.993, p-value,0.05). Network analysis based on miRNA-target interactions curated by public
databases suggests that down-regulation of the 5-miRNA signature in the subset of early-relapsing tumors would result in
an overall increased proliferative and angiogenic capacity. In summary, we have identified a set of recurrence-related
microRNAs with potential prognostic value to identify patients who will likely develop metastasis early after primary breast
surgery.
Citation: Pe´rez-Rivas LG, Jerez JM, Carmona R, de Luque V, Vicioso L, et al. (2014) A microRNA Signature Associated with Early Recurrence in Breast Cancer. PLoS
ONE 9(3): e91884. doi:10.1371/journal.pone.0091884
Editor: Sonia Rocha, University of Dundee, United Kingdom
Received November 11, 2013; Accepted February 14, 2014; Published March 14, 2014
Copyright: ß 2014 Pe´rez-Rivas et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits
unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by a grant from the Spanish Society of Medical Oncology (SEOM, to NR) and by grants from the Spanish Ministerio de
Economı´a, (SAF2010-20203 to J.L and TIN2010-16556 to J.J) and from the Junta de Andalucı´a (TIN-4026, to JJ). The funders had no role in study design, data
collection and analysis, decision to publish, or preparation of the manuscript.
Competing Interests: The authors have declared that no competing interests exist.
* E-mail: jlozano@uma.es
. These authors contributed equally to this work.
Introduction
Breast cancer comprises a group of heterogeneous diseases that
can be classified based on both clinical and molecular features [1–
5]. Improvements in the early detection of primary tumors and the
development of novel targeted therapies, together with the
systematic use of adjuvant chemotherapy, has drastically reduced
mortality rates and increased disease-free survival (DFS) in breast
cancer. Still, about one third of patients undergoing breast tumor
excision will develop metastases, the major life-threatening event
which is strongly associated with poor outcome [6,7].
The risk of relapse after tumor resection is not constant over
time. A detailed examination of large series of long-term follow-up
years, respectively, followed by a nearly flat plateau in which the
risk of relapse tends to zero [8–10]. A causal link between tumor
surgery and the bimodal pattern of recurrence has been proposed
by some investigators (i.e. an iatrogenic effect) [11]. According to
that model, surgical removal of the primary breast tumor would
accelerate the growth of dormant metastatic foci by altering the
balance between circulating pro- and anti-angiogenic factors
[9,11–14]. Such hypothesis is supported by the fact that the two
peaks of relapse are observed regardless other factors than surgery,
such as the axillary nodal status, the type of surgery or the
administration of adjuvant therapy. Although estrogen receptor
(ER)-negative tumors are commonly associated with a higher risk
In order to select the statistically significant and differentially
expressed miRNAs from Fig. 1, paired and multiple comparisons
among the prognosis groups A, B and C were performed. Two
different approaches, limma and RankProd Bioconductor, were
employed. Only those candidates with a fold change (FC).2
(either up- or down-regulated) and an adjusted p-value,0.05 were
selected (Table 2). Thus, comparison of the logFC and p-values
obtained with both limma and RankProd libraries led to the
identification of miR-149, miR-20b, miR-30a-3p, miR-342-5p,
downregulation in basal-like tumors. They also showed an inverse
relationship between the mitotic index and both miR-30a-3p and
miR-342-5p [76].
Differential expression of all six miRNAs were also determined
by RT-qPCR in the three prognosis groups (Table 2). With the
exception of miR-625, which could not be validated, miR-149,
miR-20b, miR10a, miR-30a-3p and miR-342-5p (the ‘‘5-miRNA
signature’’, from now on) were all confirmed to be down-regulated
in tumors from relapsing patients (groups B or C) when compared
Table 2. Most significant deregulated miRNAs in breast tumors from relapsing patients.
limma F* RankProd** RT-qPCR***
Comparison#
miRNA logFC adj-pval logFC adj-pval logFC SE
B/A hsa-miR-149 21.410 0.0016 21.615 ,0.00001 22.646 0.724
hsa-miR-20b 21.048 0.0071 21.237 ,0.00001 21.542 0.521
hsa-miR-30a-3p 21.359 0.0078 21.521 ,0.00001 21.001 0.514
hsa-miR-625 21.149 0.0014 21.377 ,0.00001 20.347 0.282
hsa-miR-10a 21.235 0.0168 21.547 ,0.00001 21.108 0.404
BC/A hsa-miR-149 21.120 0.0117 21.329 ,0.00001 22.555 0.681
hsa-miR-20b 21.016 0.0076 21.155 ,0.00001 21.470 0.536
hsa-miR-30a-3p 21.124 0.0256 21.326 ,0.00001 20.994 0.458
hsa-miR-625 21.003 0.0049 21.223 ,0.00001 20.266 0.237
B/AC hsa-miR-149 21.294 0.0052 21.446 ,0.00001 22.340 0.698
hsa-miR-10a 21.397 0.0093 21.647 ,0.00001 21.241 0.404
hsa-miR-342-5p 21.123 0.0159 21.254 ,0.00001 21.194 0.627
#
Group A = no recurrence, Group B = early recurrence (#24 months after surgery), Group C = late recurrence (50–60 months after surgery).
*limma F, analysis of filtered data (sd.70%) using limma.
**RankProd, analysis of unfiltered data using RankProduct algorithm.
***RT-qPCR, Relative miRNA expression was calculated using the DDCt method. The standard error (SE) was calculated based on the theory of error propagation [107].
doi:10.1371/journal.pone.0091884.t002
PLOS ONE | www.plosone.org 6 March 2014 | Volume 9 | Issue 3 | e91884
B
B
A
B
B
A
B
B
B
B
C
A
A
C
A
B
B
A
A
B
A
B
B
B
B
A
A
B
B
C
A
A
A
B
A
A
A
A
C
A
A
A
A
A
A
A
C
C
A
A
C
A
A
A
A
A
B
A
A
C
B
A
C
B
A
B
B
A
C
B
C
C
B
B
B
hsa−miR−10a_st
hsa−miR−149_st
hsa−miR−20b_st
hsa−miR−30a−star_st
hsa−miR−342−5p_st
Pérez-Rivas et al., Figure 2
-3
-2
-1
0
miR-10a
log2FoldChange
-3
-2
-1
0
miR-149
log2FoldChange
-3
-2
-1
0
miR-20b
log2FoldChange
-3
-2
-1
0
miR-30a-3p
log2FoldChange
-3
-2
-1
0
miR-342-5p
log2FoldChange
B vs A
BC vs A
B vs AC
A
B
COLABORACIÓN:
Emilio Alba
José M. Jerez
8. Siempre confirmamos con varios algoritmos
8
DEgenes Hunter - A Self-customised Gene Expression Analysis Workflow 315
Input (Count Data)
Data Filtering
Replicates 1 ?
Replicates 3 ?
DESeq2
edgeR
limma
NOISeq
DESeq2
DESeq2
edgeR
FUNCTIONAL ANALYSiS
topGO
Headmap and Clustering
Output
(Pdf Report)
YES
YES
NO
NO
Fig. 1. DEgenes Hunter main workflow
2 Methods
DEgenes Hunter - A Self-customised Gene Expression Analysis Workflow 317
GO:0003674
molecular_function
1.0000
225 / 41433
GO:0003824
catalytic activity
0.0012
128 / 19303
GO:0004347
glucose−6−phosphate ...
2.02e−11
7 / 22
GO:0004497
monooxygenase activi...
9.77e−11
15 / 294
GO:0005488
binding
0.9677
127 / 25778
GO:0008289
lipid binding
8.45e−16
29 / 797
GO:0016491
oxidoreductase activ...
3.08e−19
50 / 2066
GO:0016853
isomerase activity
3.28e−05
11 / 440
GO:0016860
intramolecular oxido...
1.68e−08
8 / 82
GO:0016861
intramolecular oxido...
4.79e−10
8 / 53
GO:0046906
tetrapyrrole binding
6.07e−11
16 / 335
GO:0097159
organic cyclic compo...
0.9982
57 / 14111
GO:1901363
heterocyclic compoun...
0.9981
57 / 14093
1 2 3 4 5 6
−1.5−1.0−0.50.00.51.01.5
sample
Samples
1.5
1.0
0.5
0.0
–0.5
–1.0
–1.5
Zscoreexpression
C1 C2 C3 T1 T2 T3
A
B
C
Samples
C1 C2 C3 T1 T2 T3
Fig. 2. Example analyses that can be performed with DEgenes Hunter on the ‘common
DEGs’ group. A: A GSEA analysis performed with topGO, where rectangle colour
represents the relative significance, ranging from dark red (most significant) to bright
yellow (least significant). B: A typical heatmap that can also be used as a quality
control to verify that control samples (C1, C2 and C3) and treatment samples (T1, T2
and T3) are grouped together. C: Expression clustering performed using cluster where
the genes have similar expression levels among control samples, and a clearly higher
value in treatment samples.
3.2 Performance Testing
Utility of ‘common DEGs’ group was confirmed comparing their FDR values.
Figure 3 shows that the FDR for ‘common DEGs’ is considerably lower than
for ‘complete DEGs’ and ‘non-common DEGs’ using separately any R package.
Since there is no clear way to set the threshold for qNOISeq [15], it is very high
in all cases.
DEgenes Hunter - A Self-customised Gene Expression Analysis Workflow 31
100/0 50/50 0/100
Fig. 4. Venn diagrams showing the numbers of DEGs found in synthetic data whe
different DEG ratios are used. 100/0 corresponds to all over-expressed/none repressed
50/50 is the balanced ratio, and 100/0 corresponds to none over-expressed/all re
pressed.
11. Bases de datos de genomas
10
Genetic and physical mapping of the QTLAR3 controlling
blight resistance in chickpea (Cicer arietinum L)
E. Madrid • P. Seoane • M. G. Claros •
F. Barro • J. Rubio • J. Gil • T. Milla´n
Received: 14 January 2014 / Accepted: 14 February 2014 / Published online: 26 February 2014
Ó Springer Science+Business Media Dordrecht 2014
Abstract Physical and genetic maps of chickpea a
QTL related to Ascochyta blight resistance and
located in LG2 (QTLAR3) have been constructed.
Single-copy markers based on candidate genes located
in the Ca2 pseudomolecule were for the first time
obtained and found to be useful for refining the QTL
position. The location of the QTLAR3 peak was linked
to an ethylene insensitive 3-like gene (Ein3). The Ein3
gene explained the highest percentage of the total
phenotypic variation for resistance to blight (44.3 %)
with a confidence interval of 16.3 cM. This genomic
region was predicted to be at the Ca2 physical position
32–33 Mb, comprising 42 genes. Candidate genes
located in this region include Ein3, Avr9/Cf9 and
Argonaute 4, directly involved in disease resistance
mechanisms. However, there are other genes outside
the confidence interval that may play a role in the
blight resistance pathway. The information reported in
this paper will facilitate the development of functional
markers to be used in the screening of germplasm
collections or breeding materials, improving the
efficiency and effectiveness of conventional breeding
methods.
Keywords Ascochyta blight Á CandidategenesÁ
Physical map Á Molecular markers
Introduction
Chickpea (Cicer arietinum L.) is a self-pollinated
diploid (2n = 2x = 16) annual grain legume widely
grown in arid and semi-arid areas across the six
continents. Together with other pulse crops, such as
lentil (Lens culinaris Medik.), dry pea (Pisum sativum
L.) and dry bean (Phaseolus vulgaris L.), chickpea is a
major source of protein in human diets, particularly in
low-income countries. In addition, chickpea crops
play an important role in the maintenance of soil
fertility, particularly in dry, rain-fed areas (Berrada
et al. 2007).
One of the most important factors contributing to
instability in chickpea yields is Ascochyta blight,
Electronic supplementary material The online version of
this article (doi:10.1007/s10681-014-1084-6) contains supple-
mentary material, which is available to authorized users.
E. Madrid () Á F. Barro
Institute for Sustainable Agriculture, CSIC, Apdo 4084,
14080 Co´rdoba, Spain
e-mail: b62mahee@uco.es
P. Seoane Á M. G. Claros
Departamento de Biologı´a Molecular y Bioquı´mica, y
Plataforma Andaluza de Bioinforma´tica, Universidad de
Ma´laga, 29071 Ma´laga, Spain
J. Rubio
A´ rea de Mejora y Biotecnologı´a, IFAPA Centro Alameda
del Obispo, Apdo 3092, 14080 Co´rdoba, Spain
J. Gil Á T. Milla´n
Departamento de Gene´tica, Universidad de Co´rdoba,
Campus Rabanales, Edif. C5, 14071 Co´rdoba, Spain
123
Euphytica (2014) 198:69–78
DOI 10.1007/s10681-014-1084-6
Genetic and physical mapping of the QTLAR3 controlling
blight resistance in chickpea (Cicer arietinum L)
E. Madrid • P. Seoane • M. G. Claros •
F. Barro • J. Rubio • J. Gil • T. Milla´n
Received: 14 January 2014 / Accepted: 14 February 2014 / Published online: 26 February 2014
Ó Springer Science+Business Media Dordrecht 2014
Abstract Physical and genetic maps of chickpea a
QTL related to Ascochyta blight resistance and
located in LG2 (QTLAR3) have been constructed.
Single-copy markers based on candidate genes located
in the Ca2 pseudomolecule were for the first time
obtained and found to be useful for refining the QTL
position. The location of the QTLAR3 peak was linked
to an ethylene insensitive 3-like gene (Ein3). The Ein3
gene explained the highest percentage of the total
phenotypic variation for resistance to blight (44.3 %)
with a confidence interval of 16.3 cM. This genomic
region was predicted to be at the Ca2 physical position
32–33 Mb, comprising 42 genes. Candidate genes
located in this region include Ein3, Avr9/Cf9 and
Argonaute 4, directly involved in disease resistance
mechanisms. However, there are other genes outside
the confidence interval that may play a role in the
blight resistance pathway. The information reported in
this paper will facilitate the development of functional
markers to be used in the screening of germplasm
collections or breeding materials, improving the
efficiency and effectiveness of conventional breeding
methods.
Keywords Ascochyta blight Á CandidategenesÁ
Physical map Á Molecular markers
Introduction
Chickpea (Cicer arietinum L.) is a self-pollinated
diploid (2n = 2x = 16) annual grain legume widely
grown in arid and semi-arid areas across the six
continents. Together with other pulse crops, such as
lentil (Lens culinaris Medik.), dry pea (Pisum sativum
L.) and dry bean (Phaseolus vulgaris L.), chickpea is a
major source of protein in human diets, particularly in
low-income countries. In addition, chickpea crops
play an important role in the maintenance of soil
fertility, particularly in dry, rain-fed areas (Berrada
et al. 2007).
One of the most important factors contributing to
instability in chickpea yields is Ascochyta blight,
Electronic supplementary material The online version of
this article (doi:10.1007/s10681-014-1084-6) contains supple-
mentary material, which is available to authorized users.
E. Madrid () Á F. Barro
Institute for Sustainable Agriculture, CSIC, Apdo 4084,
14080 Co´rdoba, Spain
e-mail: b62mahee@uco.es
P. Seoane Á M. G. Claros
Departamento de Biologı´a Molecular y Bioquı´mica, y
Plataforma Andaluza de Bioinforma´tica, Universidad de
Ma´laga, 29071 Ma´laga, Spain
J. Rubio
A´ rea de Mejora y Biotecnologı´a, IFAPA Centro Alameda
del Obispo, Apdo 3092, 14080 Co´rdoba, Spain
J. Gil Á T. Milla´n
Departamento de Gene´tica, Universidad de Co´rdoba,
Campus Rabanales, Edif. C5, 14071 Co´rdoba, Spain
123
Euphytica (2014) 198:69–78
DOI 10.1007/s10681-014-1084-6
SNP
SNP
12. BD de transcriptomas
11
De novo assembly of maritime pine transcriptome:
implications for forest breeding and biotechnology
Javier Canales1,†
, Rocio Bautista2,†
, Philippe Label3†
, Josefa Gomez-Maldonado1
, Isabelle Lesur4,5,6
,
Noe Fernandez-Pozo2
, Marina Rueda-Lopez1
, Dario Guerrero-Fernandez2
, Vanessa Castro-Rodrıguez1
,
Hicham Benzekri2
, Rafael A. Ca~nas1
, Marıa-Angeles Guevara7
, Andreia Rodrigues8
, Pedro Seoane2
,
Caroline Teyssier9
, Alexandre Morel9
, Francßois Ehrenmann4,5
, Gregoire Le Provost4,5
, Celine Lalanne4,5
, Celine
Noirot10
, Christophe Klopp10
, Isabelle Reymond11
, Angel Garcıa-Gutierrez1
, Jean-Francßois Trontin11
, Marie-Anne
Lelu-Walter9
, Celia Miguel8
, Marıa Teresa Cervera7
, Francisco R. Canton1
, Christophe Plomion4,5
, Luc Harvengt11
,
Concepcion Avila1,2
, M. Gonzalo Claros1,2
and Francisco M. Canovas1,2,
*
1
Departamento de Biologıa Molecular y Bioquımica, Facultad de Ciencias, Universidad de Malaga, Malaga, Spain
2
Plataforma Andaluza de Bioinformatica, Edificio de Bioinnovacion, Parque Tecnologico de Andalucıa, Malaga, Spain
3
INRA, Universite Blaise Pascal, Aubiere Cedex, France
4
INRA, Cestas, France
5
Universite de Bordeaux, Talence, France
6
HelixVenture, Merignac, France
7
Departamento de Ecologıa y Genetica Forestal, INIA-CIFOR, Madrid, Spain
8
Forest Biotech Lab, IBET/ITQB, Oeiras, Portugal
9
INRA, Unite Amelioration, Genetique et Physiologie Forestieres, Orleans Cedex 2, France
10
INRA de Toulouse Midi-Pyrenees, Auzeville, Castanet Tolosan cedex, France
11
FCBA, P^ole Biotechnologie et Sylviculture, Cestas, France
Received 20 July 2013;
revised 24 September 2013;
accepted 26 September 2013.
*Correspondence (Tel: +34 952131942;
fax: +34 952132376;
email: canovas@uma.es)
†
These authors contributed equally to work.
Summary
Maritime pine (Pinus pinaster Ait.) is a widely distributed conifer species in Southwestern
Europe and one of the most advanced models for conifer research. In the current work,
comprehensive characterization of the maritime pine transcriptome was performed using a
combination of two different next-generation sequencing platforms, 454 and Illumina.
De novo assembly of the transcriptome provided a catalogue of 26 020 unique transcripts in
maritime pine trees and a collection of 9641 full-length cDNAs. Quality of the transcriptome
assembly was validated by RT-PCR amplification of selected transcripts for structural and
regulatory genes. Transcription factors and enzyme-encoding transcripts were annotated.
Furthermore, the available sequencing data permitted the identification of polymorphisms and
Plant Biotechnology Journal (2014) 12, pp. 286–299 doi: 10.1111/pbi.12136
http://www.scbi.uma.es/sustainpinedb/
RESEARCH ARTICLE Open Access
De novo assembly, characterization and functional
annotation of Senegalese sole (Solea senegalensis)
and common sole (Solea solea) transcriptomes:
integration in a database and design of a
microarray
Hicham Benzekri1,2
, Paula Armesto3
, Xavier Cousin4,5
, Mireia Rovira6
, Diego Crespo6
, Manuel Alejandro Merlo7
,
David Mazurais8
, Rocío Bautista2
, Darío Guerrero-Fernández2
, Noe Fernandez-Pozo1
, Marian Ponce3
, Carlos Infante9
,
Jose Luis Zambonino8
, Sabine Nidelet10
, Marta Gut11
, Laureana Rebordinos7
, Josep V Planas6
, Marie-Laure Bégout4
,
M Gonzalo Claros1,2
and Manuel Manchado3*
Abstract
Background: Senegalese sole (Solea senegalensis) and common sole (S. solea) are two economically and
evolutionary important flatfish species both in fisheries and aquaculture. Although some genomic resources and
tools were recently described in these species, further sequencing efforts are required to establish a complete
transcriptome, and to identify new molecular markers. Moreover, the comparative analysis of transcriptomes will be
useful to understand flatfish evolution.
Results: A comprehensive characterization of the transcriptome for each species was carried out using a large set
of Illumina data (more than 1,800 millions reads for each sole species) and 454 reads (more than 5 millions reads
only in S. senegalensis), providing coverages ranging from 1,384x to 2,543x. After a de novo assembly, 45,063 and
38,402 different transcripts were obtained, comprising 18,738 and 22,683 full-length cDNAs in S. senegalensis and S.
solea, respectively. A reference transcriptome with the longest unique transcripts and putative non-redundant new
transcripts was established for each species. A subset of 11,953 reference transcripts was qualified as highly reliable
orthologs (97% identity) between both species. A small subset of putative species-specific, lineage-specific and
flatfish-specific transcripts were also identified. Furthermore, transcriptome data permitted the identification of single
nucleotide polymorphisms and simple-sequence repeats confirmed by FISH to be used in further genetic and expression
studies. Moreover, evidences on the retention of crystallins crybb1, crybb1-like and crybb3 in the two species of soles are
also presented. Transcriptome information was applied to the design of a microarray tool in S. senegalensis that was
successfully tested and validated by qPCR. Finally, transcriptomic data were hosted and structured at SoleaDB.
Conclusions: Transcriptomes and molecular markers identified in this study represent a valuable source for future
genomic studies in these economically important species. Orthology analysis provided new clues regarding sole
genome evolution indicating a divergent evolution of crystallins in flatfish. The design of a microarray and establishment
of a reference transcriptome will be useful for large-scale gene expression studies. Moreover, the integration of
Benzekri et al. BMC Genomics 2014, 15:952
http://www.biomedcentral.com/1471-2164/15/952
http://www.juntadeandalucia.es/
agriculturaypesca/ifapa/soleadb_ifapa/
14. AutoFlow: automatización de «workflows»
13
Figure 4
Time(hours)
Total_time
Euler_assembling_k_25
Euler_assembling_k_29
MIRA3_assembling
Euler_remove_artifacts_k_25
Euler_remove_artifacts_k_259
validate_contigs_with_mapping_k_25
validate_contigs_with_mapping_k_29
rescue_unmapped_contigs_k_25
rescue_unmapped_contigs_k_29
recover_MIRA3_debris
MIRA3_remove_artifacts
CAP3_reconciliation_k_25
CAP3_reconciliation_k_29
FLN_analysis_of_CAP3_contigs_k_25
FLN_analysis_of_CAP3_contigs_k_29
TIDs
choose_best_assembly+cp_best_assembly
AutoFlow, a Versatile Workflow Engine Illustrated by Assembling an
Optimised de novo Transcriptome for a Non-Model Species, such as Faba
Bean (Vicia faba)
Running title: AutoFlow, a versatile workflow engine
Pedro Seoane1
, Sara Ocaña2
, Rosario Carmona3
, Rocío Bautista3
, Eva Madrid4
,
Ana M. Torres2
, M. Gonzalo Claros1,3,*