Monarch geneset OGS2.0

DPOGS210462
TranscriptDPOGS210462-TA2187 bp
ProteinDPOGS210462-PA728 aa
Genomic positionDPSCF300062 + 314893-322215
RNAseq coverage835x (Rank: top 15%)
Annotation
HeliconiusHMEL0134100.086.52% 
BombyxBGIBMGA012640-TA7e-5286.67% 
DrosophilaBicD-PA2e-12144.80% 
EBI UniRef50UniRef50_E2A8B54e-12646.99%Protein bicaudal D n=10 Tax=Pancrustacea RepID=E2A8B5_CAMFO
NCBI RefSeqXP_970925.25e-12947.70%PREDICTED: similar to AGAP010206-PA [Tribolium castaneum]
NCBI nr blastpgi|1892391231e-12747.70%PREDICTED: similar to AGAP010206-PA [Tribolium castaneum]
NCBI nr blastxgi|3227983103e-12847.75%hypothetical protein SINV_80380 [Solenopsis invicta]
Group
Gene OntologyGO:00068102.1e-67transport
GO:00057942.1e-67Golgi apparatus
KEGG pathway 
InterPro domain[78-366] IPR0184772.1e-67Bicaudal-D protein, microtubule-associated
Orthology groupMCL10761 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210462-TA
ATGGCAGGGACGGCAGATAGTGATTTATCGATTTCCGAGCTGAAGGCGGAGATCGAGAGGCTTAGTCGAGAGCTTGACCAAGCCAGCAGCGAAAAGATTCAATCGGCACAGCTGGGTCTGGTGCTGTTGGAAGAGAAAAGTGCCTTGCAGCTGCGATGTGATGAGCTTGAAACTCTCTACGAAAACACGAAGCACGAACTGGACATCACACAAGAGGCTTTAATGAAACTAGACACAACACAAAAAGTAACTACACAGAGCGGTATAGAACAAGAAAATGCCCTTCTCAATGAATCTGCGGCTATGGAAAGTTCTCTCACATTACAAATTATTGAATTGGAAGGAGAGACGAAACAGTTACGACACGAATTAGAAAGAGTGGTCAGCGAACGAGATAGATTGTTGGCGGAGAGTTCAGAACTCGGAGCTGATAAAGCCACGAGGGAATCGGAGAGGGTGGCGTTAAGAGCTGAACTGAGAGAAGCGAGGCAGCGGGAACAGCGCTTGATAGTTGATATAGGTGATTTAGAAGATGAGAATATATCGCTACAGAAACAGGTCTCGGCGTTACGGTCGTCACAGGTAGAATTCGAAGGTCTGAAGCATGAGGTGCGTCAATTGCGAGAAGAAGCGGAGAATGCACGCGCAGCCGCTGATGAAACCGCAGCGCTGCGGCGTATCGCCGAGAGGCAGCTGGCGGAAGCGCTGGAAGCGCTGCAAGCGGAACGAGAAGCCAAATTTGCGGCCAAGAAAGAACTGGACGCCCACCTCAGCCGGGAGGCTCAGTTCAATATAACGAATCTGGCTTACAGTATACGAGGTATGCCAGAGGAAGGCGCTGATGATGAAGTGGAGGCGGGCGCGTCCGGCGGGGAGCTGGTGGCGGATCATCACGCGGACTTGTTCTCAGAGGTCCACCTGCACGAGATATCACGTCTAGAGAAACAACTGGAACAGGCGCATAACGAAAACTCACAGTTGTCGTCAGCACTGCGGGCGGCGCAGACGAGCGCGGAGACCGAGAGCGCGGCCGCCGGCGTGCTGCGGGCAGGACTCGCCAGGCTACTGTCCAGGGTCGCAGCACTGCACACGCTACACGGGGACTGCGCACCCCTGGAGGACGACAAGGTTGAAGGTGGCGTGGCAGCTCGGGCGGCCAAATGGTTGACGTGGTGGCGCGTGTCGGGCGGAGAGCTGGTGGCTCTGTTGGCGGCGCTTAAGGAACTAGACTCGCCACTTGCGGACGGGTCGCCGGCCGCCTTGCAAAGAGCACAGCTCGCACAGCTCAGCGATAGAATGGCAGACGCTGAAATAAGATGCGCCGCGCTGCAAGCTGACGCCGACCTGTTGAGGACACTCGCTGGAGGCGCCGGCCGCGCGCTCTCCTCCGCCGCCCCCGCGCTGGCGTCGGCCGCTGACACGCTGGCTCAGCTCTACCACCACGTGTGCGCCGTCAACGGCACTCAGCCCGAGCGTCTGCTGTTGGAGCACGCCGGACAAAACGACGTTACAGACGGGTCCGGTCGTGTGGATGAGGAGGCGTTGGCGCTGGCGGCGGGCGAGCTGGAGGGTCTGCGGGCGGCCGGCCTCGTGGCCAGGCATGCTGACACACTGCTCGACCAACTCACACACCTGAGGGCAGCACTGGACACCGCGCTGGACTCTAGACATAGACACCAACCAGAGACACCGTCCTACCCTTTCTTGAGAGCTACGAATTTTCTAACCAACCAAATGAAGAGATTTAAACTGATAAATATCGATAGTATTAACATCGAAGGAATGGAAAGTAAAAGATTCCACGGCTACCCTCCACAAGCAAATCGTGATTGCCAGTTAGTTTGCAACGAGCGTTCAGTTAAAGGGATCACTAATTTCTTTGAAAATATAGCGAGAGTCCATAGAAATCAATCAGAAACTAAAGATGAGTGTAGCGATATCACGAGAATAACTCCGTTCCTCTGCAAGCGCGCGCCTTTCAATGTGGAAAATAAAGCCACCTCCCTACCATGTGATTTCAACCTGCACGTGGCGTGGAGGCATTCAGTGTTACACGGGAAGAATTTTTATTTTAAGAAAGCAAATTTAGCACAGAATATGAAAGATTCTAAAAACGTTTTCAATGTAAGGGAGTCCTTGCGGGCGAGCGTGGTAGCGGGCATCACTATGTTGCAATACATTATGTGA

Protein sequence:

>DPOGS210462-PA
MAGTADSDLSISELKAEIERLSRELDQASSEKIQSAQLGLVLLEEKSALQLRCDELETLYENTKHELDITQEALMKLDTTQKVTTQSGIEQENALLNESAAMESSLTLQIIELEGETKQLRHELERVVSERDRLLAESSELGADKATRESERVALRAELREARQREQRLIVDIGDLEDENISLQKQVSALRSSQVEFEGLKHEVRQLREEAENARAAADETAALRRIAERQLAEALEALQAEREAKFAAKKELDAHLSREAQFNITNLAYSIRGMPEEGADDEVEAGASGGELVADHHADLFSEVHLHEISRLEKQLEQAHNENSQLSSALRAAQTSAETESAAAGVLRAGLARLLSRVAALHTLHGDCAPLEDDKVEGGVAARAAKWLTWWRVSGGELVALLAALKELDSPLADGSPAALQRAQLAQLSDRMADAEIRCAALQADADLLRTLAGGAGRALSSAAPALASAADTLAQLYHHVCAVNGTQPERLLLEHAGQNDVTDGSGRVDEEALALAAGELEGLRAAGLVARHADTLLDQLTHLRAALDTALDSRHRHQPETPSYPFLRATNFLTNQMKRFKLINIDSINIEGMESKRFHGYPPQANRDCQLVCNERSVKGITNFFENIARVHRNQSETKDECSDITRITPFLCKRAPFNVENKATSLPCDFNLHVAWRHSVLHGKNFYFKKANLAQNMKDSKNVFNVRESLRASVVAGITMLQYIM-