An official website of the United States government.

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

The USDA-ARS Ag100Pest Initiative: High-Quality Genome Assemblies for Agricultural Pest Arthropod Research.

    Summary
    Publication Type
    Journal Article
    Abstract

    The phylum Arthropoda includes species crucial for ecosystem stability, soil health, crop production, and others that present obstacles to crop and animal agriculture. The United States Department of Agriculture's Agricultural Research Service initiated the Ag100Pest Initiative to generate reference genome assemblies of arthropods that are (or may become) pests to agricultural production and global food security. We describe the project goals, process, status, and future. The first three years of the project were focused on species selection, specimen collection, and the construction of lab and bioinformatics pipelines for the efficient production of assemblies at scale. Contig-level assemblies of 47 species are presented, all of which were generated from single specimens. Lessons learned and optimizations leading to the current pipeline are discussed. The project name implies a target of 100 species, but the efficiencies gained during the project have supported an expansion of the original goal and a total of 158 species are currently in the pipeline. We anticipate that the processes described in the paper will help other arthropod research groups or other consortia considering genome assembly at scale.

    Citation
    Childers AK, Geib SM, Sim SB, Poelchau MF, Coates BS, Simmonds TJ, Scully ED, Smith TPL, Childers CP, Corpuz RL, Hackett K, Scheffler B. The USDA-ARS Ag100Pest Initiative: High-Quality Genome Assemblies for Agricultural Pest Arthropod Research.. Insects. 2021 Jul 09; 12(7).
    Publication Date
    2021 Jul 09
    DOI
    10.3390/insects12070626
    Authors
    Childers AK, Geib SM, Sim SB, Poelchau MF, Coates BS, Simmonds TJ, Scully ED, Smith TPL, Childers CP, Corpuz RL, Hackett K, Scheffler B
    Cross Reference
    Database Reference Annotations
    Database Accession
    PMID 34357286
    Analyses
    Name Program Status
    Tribolium castaneum genome assembly icTriCast1.1 (GCF_031307605.1) HiFiASM Current
    Schistocerca americana genome assembly iqSchAmer2.1 (GCF_021461395.2) HiFiASM v. 0.15.4; 3D-DNA v. 210817; Juicebox Assembly Tools v. 1.11 Current
    Anabrus simplex genome assembly ASM4041472v1 (GCF_040414725.1) HiFiASM Current
    Ceratitis capitata genome assembly Ccap_2.1 (GCF_000347755.3) NA Current
    Ornithodoros turicata genome assembly ASM3712646v1 (GCF_037126465.1) HiFiASM Current
    Neodiprion fabricii genome assembly iyNeoFabr1.1 (GCF_021155785.1) HiFiASM v. 0.16.1-r375; Juicebox Assembly Toolkit v. 1.11 Current
    Helicoverpa zea genome assembly ilHelZeax1 (GCF_022581195.2) FALCON v. 1.8.1; FALCON-Unzip v. 1.3.7; purge_dups v. 1.2.5; bwa-mem v. 2.2.1; Juicebox v. 1.11.08; Arrow gcpp v. 2.0.2; FreeBayes v. 1.0.2; Merqury v. 1.1 Current
    NCBI Anabrus simplex Annotation Release GCF_040414725.1-RS_2024_09 NCBI Eukaryotic Genome Annotation Pipeline Current
    Plodia interpunctella genome assembly ilPloInte3.2 (GCF_027563975.2) HiFiAdapterFilt v. 2.0.0; HiFiASM v. 0.16.1; YaHS v. 1.1; Juicebox v. 1.11.08 Current
    Zeugodacus cucurbitae genome assembly idZeuCucr1.2 (GCF_028554725.1) HiFiASM Current
    Bactrocera dorsalis genome assembly ASM2337382v1 (GCF_023373825.1) NextDenovo Current
    Schistocerca gregaria genome assembly iqSchGreg1.2 (GCF_023897955.1) HiFiASM Current
    Neodiprion lecontei genome assembly iyNeoLeco1.1 (GCF_021901455.1) HiFiASM v. 0.16.1-r375; Juicebox Assembly Toolkit v. 1.11 Current
    Schistocerca piceifrons genome assembly iqSchPice1.1 (GCF_021461385.2) HiFiASM v. 0.15.4; 3D-DNA v. 210817; Juicebox Assembly Tools v. 1.11 Current
    Schistocerca nitens genome assembly iqSchNite1.1 GCF_023898315.1 HiFiASM v. 0.15.4; 3D-DNA v. 210817; Juicebox Assembly Tools v. 1.11 Current
    Schistocerca serialis cubense genome assembly iqSchSeri2.2 (GCF_023864345.2) HiFiASM v. 0.15.4; 3D-DNA v. 210817; Juicebox Assembly Tools v. 1.11 Current
    Anthonomus grandis grandis genome assembly icAntGran1.3 (GCF_022605725.1) HiFiASM Current
    Diprion similis genome assembly iyDipSimi1.1 (GCF_021155765.1) HiFiASM v. 0.16.1-r375; Juicebox Assembly Toolkit v. 1.11 Current
    Bombus huntii genome assembly iyBomHunt1.1 (GCF_024542735.1) HiFiASM Current
    Schistocerca cancellata genome assembly iqSchCanc2.1 GCF_023864275.1 HiFiASM v. 0.15.4; 3D-DNA v. 210817; Juicebox Assembly Tools v. 1.11 Current
    Dermacentor andersoni genome assembly qqDerAnde1.2 (GCF_023375885.1) HiFiASM Current
    Aethina tumida genome assembly icAetTumi1.1 (GCF_024364675.1) HiFiASM Current
    Pectinophora gossypiella genome assembly ilPecGoss1.1 (GCF_024362695.1) FALCON v. 1.8.1; FALCON-Unzip v. 1.3.7; purge_dups v. 1.2.5; bwa-mem v. 2.2.1; YaHS v. 1.0; Juicebox v. 1.11.08; Arrow gcpp v. 2.0.2; FreeBayes v. 1.0.2; Merqury v. 1.1 Current
    Anastrepha ludens genome assembly idAnaLude1.1 (GCF_028408465.1) HiFiASM Current
    Anastrepha obliqua genome assembly idAnaObli1_1.0 (GCF_027943255.1) HiFiASM Current
    Vespa mandarinia genome assembly V.mandarinia_Nanaimo_p1.0 (GCF_014083535.2) IPA Current
    Diorhabda carinulata genome assembly icDioCari1.1 (GCF_026250575.1) HiFiAdapterFilt v. 2.0.0; HiFiASM v. 0.16.1; YaHS v. 1.1; Juicebox v. 1.11.08 Additional genomes Browse all Diorhabda carinulata genomes (3) BioProject PRJNA788877 Diorhabda carinulata genome sequencing, Current
    Microplitis mediator genome assembly iyMicMedi2.1 (GCF_029852145.1) HiFiASM Current
    Microplitis demolitor genome assembly iyMicDemo2.1a (GCF_026212275.2) HiFiASM Current
    Diorhabda sublineata genome assembly icDioSubl1.1 (GCF_026230105.1) HiFiAdapterFilt v. 2.0.0; HiFiASM v. 0.16.1; YaHS v. 1.1; Juicebox v. 1.11.08 Current
    Cylas formicarius genome assembly icCylForm1.1 (GCF_029955315.1) HiFiASM Current
    Cydia pomonella genome assembly ilCydPomo1 (GCF_033807575.1) FALCON Current
    Amyelois transitella genome assembly ilAmyTran1.1 (GCF_032362555.1) HiFiASM Current
    Diachasmimorpha longicaudata genome assembly iyDiaLong2 (GCF_034640455.1) hifiasm Current
    Vanessa tameamea genome assembly ilVanTame1 primary haplotype (GCF_037043105.1) HiFiASM Current
    Ceratitis capitata genome assembly Ccap_2.1 (GCF_000347755.3) AllPaths v. 35218; ATLAS-link v. 1.0; ATLAS-gapfill v. 2.2; redundans v. 0.12c Current
    Neodiprion pinetum genome assembly iyNeoPine1.1 (GCF_021155775.1) HiFiASM v. 0.16.1-r375; Juicebox Assembly Toolkit v. 1.11 Supressed
    Plodia interpunctella genome assembly ilPloInte3.1 (GCF_027563975.1) HiFiAdapterFilt v. 2.0.0; HiFiASM v. 0.16.1; YaHS v. 1.1; Juicebox v. 1.11.08 Supressed