Monarch geneset OGS2.0

DPOGS203995
TranscriptDPOGS203995-TA4392 bp
ProteinDPOGS203995-PA1463 aa
Genomic positionDPSCF300005 + 1428453-1440763
RNAseq coverage379x (Rank: top 32%)
Annotation
HeliconiusHMEL0138660.089.24% 
BombyxBGIBMGA002140-TA0.092.81% 
DrosophilaDIP2-PA0.062.76% 
EBI UniRef50UniRef50_UPI000206079E0.072.37%UPI000206079E related cluster n=2 Tax=unknown RepID=UPI000206079E
NCBI RefSeqXP_002432341.10.075.15%disco-interacting protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700047700.076.56%hypothetical protein TcasGA2_TC010545 [Tribolium castaneum]
NCBI nr blastxgi|2700047700.076.98%hypothetical protein TcasGA2_TC010545 [Tribolium castaneum]
Group
Gene OntologyGO:00081529e-48metabolic process
GO:00038249e-48catalytic activity
KEGG pathwaybte:BTH_II02768e-26 
 K01913 (E6.2.1.-)maps-> Benzoate degradation via CoA ligation
    Propanoate metabolism
    Caprolactam degradation
    Limonene and pinene degradation
    Tropane, piperidine and pyridine alkaloid biosynthesis
InterPro domain[880-1382] IPR0008739e-48AMP-dependent synthetase/ligase
Orthology groupMCL10982 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203995-TA
ATGTACGCCGCTTTTGACTTGAACATGGTACAAGTGAGGCAGGAGGCAGTACAACAGGCGCTCGCTGAAATGCAGAACAGACCTAAACCTTCACTTCCGATGCCATCGAAGAGAACCTCTATGATGGCCAAAAGTCCTGATAGAGAAAGACATGATCTGTCATCGAGTTCAGATGAAGATTCTTGCGCTGGTGACAGTGAGTTGCCCCCTCCTCCAGAGTTGGGTTCACCACCAGCTAGACGACCAGCCGCACCTCATCCACGATCCCACCCACACCCACTGCCACAACCGCACCACCAGCGTGAAAAGAAACATGGTCTCAGGGACAAGACGGAAATAGACTTGTCAGACATCACACATCTACCGGCTTACATCCAACCGGACGTTACGCACACTACGTCGGGGGCCCGGCGTGGTGGAGCTCATATAGCTGACCGTGTTCAGTGTTACGCTCAACCCGAAGACACTGGCACCGGCACCGGCAGGTGGAAGGTATCAGCCAAAATACAACAACTTCTCAACACCTTAAAGCGCCCGAAACGTCGACCGCTGCCGGAGTTCTATGAAGACGATGACATTGAATTAGAAATCGCTGCGAACCCCAAAGATCCGAACGCCCCCAAACCCGAGGGAGGCACAATGACCCCGGCTGTGGGCGAGCAACTTGTGGTACCCGCGGGACTCCCACGAAACCTTGAAGCAGCTTTACAAAGATATGGAACAGCATCATTCAAAGCCAACGTAGCTACAGTGCTCGATCCAAACGGAAAACTGAGCAATTCACTTACCTATGGCAAGTTGCTAAGCCGCTCACTAAAAATTGCTCACGCTTTGCTGAACAAAACGTTCACATCGAAAACTAGCAGCGGCGGACCGCTGACGGGGGATAATTCCATAAAACCGGGAGACAGGGTGGCTCTGGTGTACCCTAACAATGATCCTATAAACTTCATGTGTGCGTTCTACGGTTGCCTGCAAGCCGGCATAGTGCCAGTGCCGATTGAAGTGCCATTGACGAGACGGGACGCCGGCCTTCAGCAAGTCGGCTTCCTGTTGGGATCTTGTGGTATACAATATGCACTCACGTCTGACAACTGCTTGAAAGGGTTACCGAAAACGTCATCGGGGGACGTGGTATCCTTCCGTGGCTGGCCGTCACTGCAGTGGGTGTCGACAGAGAAGTTGCAAAGACCTCCTCGAGACTGGATCCCACCACCGCGCCCCGCCGAAGACTCCCCAGCTCATATAGAACACACCTCCGCCGCTGATGGTTCAGCGATGGGAGTCATCGTTACAAGGTCGTCAATGTTGGCACATAGTCGTATGTTGTCCGTCGCCTGCAACTACACCGAAGGCGAACACATGGTGTGCGTGTTGGACTTCAAACGTGAAACTGGTCTCTGGCACGCTGTGTTAGCCAGCGTTCTCAATGGAATGCACGTTATATTCATACCATACGCCCTCATGAAAGTCAGCCCGGCCTCCTGGATGCACATGATCACTAAATACAGGGCGTCGGTCGCGATTGTGAAGTCACGTGATCTCCACTGGGGCCTATTGGCGACTCGTGACCACAAGGAGATATCTCTCAGCTCGTTGCGCATGTTATTGGTCGCTGATGGAGCTAACCCTTGGTCTCTATCTTCGTGTGACCAGTTCCTATCAATATTCCAAAGTAAAGGTGTTCGCGGCGACGCTATCTGTCCGTGTGCATGCAGCAGTGAGTCGTTGACGGTTTGCGTGCGGCGTGCGGGACGAGGGGGAGCAGCAGCTGGCCGGGGCGTGCTGTCCATGTCCGGACTCTCATACGGAGTGGTTCGAGTAGACGCTGAGAACTCCCTTACCTCACTCACACTTCAAGACTGCGGACAAGTTATGCCATCCTGTGTAATAGTGGTAGTGAAGATGGAGGGTCCCGCCTATCTATGCAAGACCGACGAAGTGGGGGAGATATGTGTATTGTCTGGAGCCACTGGATCGGGTTACTGGGGACTACCCGGTTTAACTAACACTGTGTTTAGAGTACAGCCGTTGGATGCTGACGGGGAGCCTATTGGAGAGGAACATTATGTTAGAAGCGGGTTGCTAGGTTTCCTTGGCCCAGGAGGTTTAGTATTTGTTTGCGGTTCTCGTGATGGTCTTATGACGGTTACTGGCAGAAAACACAACATGGACGATATAATCGCAACTGTTCTTGCTGTGGAACCAATGAAGTTCATATACAGGGGTCGTATAGCAGTGTTCTCTGTTCGAGTTTTGCGTGATGAACGGATATGTATAGTGGCGGAACAAAGGCCAGATTGTGGGGAAGAAGAGTCATTCCAATGGATGTCTCGCGTTTTGCAAGCCGTTGATTCTATTCACCAAGTGGGTATATATTGTTTGGCACTTGTGCAACCAAACTATCTACCCAAAACACCCCTCGATGTTGGACCAGCTTCGGTCATAGTTGGAAATTTAGTACAAGGAAATCGTCTAGCGTCTGCCCAGGGTCGGGATATGGGATATAGTGATGATTCTGATGCTGCTAGAAAATATCAGTTTATATCACAAATATTGAGGTGGCGCGCTCAGAGCACATCCGACCATGTCATATTCACATTGCTTAATTCCAAGGGTGCTGTATCAAAAGTACTGACATGTGCCGAATTGCATAAAAAAGCGGAAAGAATCGGAAATCTATTGCTAGAGAAAGGCAGAGTGAACACCGGAGACCATGTAGCTTTGATATTTCCACCGGGGCTAGATCTCATTTGCGCATTCTACGGCTGTTTGTACGTAGGTGCTGTTCCCGTTACAATTAGACCACCACATCCTCAGAACCTCCATACCACGCTACCAACAGTACGCATGATAGTAGATGTGAGCAAAGCTACTTTGATCCTTTCCAATCAATCTGTGATAAAATTGCTCAGATCAAAAGAAGCTAGCAACGTTCTCGATAGCAAGGCGTGGCCTATTACACTTGATACAGATGACGTTCCTAAGAAGAAATTACCAATATTATACCGAGCTCCTACAGCAGAAATGCTTGCTTATTTGGACTTCAGTGTTTCTACTACCGGGATGTTGGCTGGTATTAAGATGTCTCATGCAGCTGTCACTTCCCTCTGCCGTTCAATGAAAATCGCTTGTGAATTATACCCCTCTAGACACATAGCACTCTGCTTGGACCCCTACTGCGGACTTGGATTTGCTCTATGGTGCTTGAGTAGCATCTACTCCGGTCACCATTCTATTCTTATTCCTCCATCAGAAGTGGAAATTAACCCCGCCTTATGGCTTAGTGCTGTATCTCAATATAAAGTACGTGACACATTCTGTTCTTACGGTGTAATGGAATTGTGTACCAAAGGCCTCGGCAGCTCGGTTAACCAGCTTAAAGCGAAAGGAATTAATTTGGCATGCGTTCGAACGTGTGTTGTAGTTGCAGAAGAACGACCACGAATTAATTTGACAAACTCATTTTCGAAGTTATTCTCAGCACTTGGCCTTACTCCACGTGCGGTGTCCACATCGTTCGGTTGTCGTGTCAACATAGCCATCTGCCTCCAAGGTGCATCAAGTCCTGAACCGTCTACAGTTTATGTCGATCTAAGAGCATTGCGTAATGATCGTGTCTCGTTAGTGGAACGCGGGAGCCCTCACTCTCTTTGCCTTATGGAGTCCGGCAAATTGTTGCCAGGAGTTAAAGTAATCACAGCTAATCCTGAAACTAAAGGCCAGTGCGGGGATTCCCATTTAGGTGAAATATGGGTGCAGTCACCTCATAATGCTAGCGGCTACTTCACGATATATGGTGATGAGAGTGACTATGCTGACCATTTTAGTGCTCAATTAGTTACCGGTAACACTGGGGAGGTTTACGCTAGGACCGGGTACCTCGGTTTTTTGCGACGAACTGAAATCAGTACGACGAACGCATCTGACGATACGTCTCTATTGGCACGAGACAGTGACACAGAGTCTATGTTATCTGGATGCGGCAGTGTGTCTGGTCTAACTGACACACACGACACACACGACGCTGTGTTTGTGGTGGGAGCTCTTGATGAAACAATCATGTTACGCGGTATGAGATACCATCCGATCGATATCGAAAATTCAGTTATGAGGTGCCATAAAAAAATTGCTGAATGCGCTGTTTTTACATGGACAAATCTGCTGGTGGTAGTTGTAGAGTTGGACGGTAACGACAGCGAAGCTTTGAATCTGGTGCCCCTTGTGACGAATACTGTGCTAGAGGAGCACCATCTAATTGTTGGAGTCGTGGTGGTGGTAGACCCCGGAGTGGTGCCCATCAACTCTAGAGGAGAGAAACAACGCATGCACCTCCGTGACGGATTCCTCTCCGATCAGATTGATGCTATATACATAGCTTACAACATGTAA

Protein sequence:

>DPOGS203995-PA
MYAAFDLNMVQVRQEAVQQALAEMQNRPKPSLPMPSKRTSMMAKSPDRERHDLSSSSDEDSCAGDSELPPPPELGSPPARRPAAPHPRSHPHPLPQPHHQREKKHGLRDKTEIDLSDITHLPAYIQPDVTHTTSGARRGGAHIADRVQCYAQPEDTGTGTGRWKVSAKIQQLLNTLKRPKRRPLPEFYEDDDIELEIAANPKDPNAPKPEGGTMTPAVGEQLVVPAGLPRNLEAALQRYGTASFKANVATVLDPNGKLSNSLTYGKLLSRSLKIAHALLNKTFTSKTSSGGPLTGDNSIKPGDRVALVYPNNDPINFMCAFYGCLQAGIVPVPIEVPLTRRDAGLQQVGFLLGSCGIQYALTSDNCLKGLPKTSSGDVVSFRGWPSLQWVSTEKLQRPPRDWIPPPRPAEDSPAHIEHTSAADGSAMGVIVTRSSMLAHSRMLSVACNYTEGEHMVCVLDFKRETGLWHAVLASVLNGMHVIFIPYALMKVSPASWMHMITKYRASVAIVKSRDLHWGLLATRDHKEISLSSLRMLLVADGANPWSLSSCDQFLSIFQSKGVRGDAICPCACSSESLTVCVRRAGRGGAAAGRGVLSMSGLSYGVVRVDAENSLTSLTLQDCGQVMPSCVIVVVKMEGPAYLCKTDEVGEICVLSGATGSGYWGLPGLTNTVFRVQPLDADGEPIGEEHYVRSGLLGFLGPGGLVFVCGSRDGLMTVTGRKHNMDDIIATVLAVEPMKFIYRGRIAVFSVRVLRDERICIVAEQRPDCGEEESFQWMSRVLQAVDSIHQVGIYCLALVQPNYLPKTPLDVGPASVIVGNLVQGNRLASAQGRDMGYSDDSDAARKYQFISQILRWRAQSTSDHVIFTLLNSKGAVSKVLTCAELHKKAERIGNLLLEKGRVNTGDHVALIFPPGLDLICAFYGCLYVGAVPVTIRPPHPQNLHTTLPTVRMIVDVSKATLILSNQSVIKLLRSKEASNVLDSKAWPITLDTDDVPKKKLPILYRAPTAEMLAYLDFSVSTTGMLAGIKMSHAAVTSLCRSMKIACELYPSRHIALCLDPYCGLGFALWCLSSIYSGHHSILIPPSEVEINPALWLSAVSQYKVRDTFCSYGVMELCTKGLGSSVNQLKAKGINLACVRTCVVVAEERPRINLTNSFSKLFSALGLTPRAVSTSFGCRVNIAICLQGASSPEPSTVYVDLRALRNDRVSLVERGSPHSLCLMESGKLLPGVKVITANPETKGQCGDSHLGEIWVQSPHNASGYFTIYGDESDYADHFSAQLVTGNTGEVYARTGYLGFLRRTEISTTNASDDTSLLARDSDTESMLSGCGSVSGLTDTHDTHDAVFVVGALDETIMLRGMRYHPIDIENSVMRCHKKIAECAVFTWTNLLVVVVELDGNDSEALNLVPLVTNTVLEEHHLIVGVVVVVDPGVVPINSRGEKQRMHLRDGFLSDQIDAIYIAYNM-