Monarch geneset OGS2.0

DPOGS214860
TranscriptDPOGS214860-TA1941 bp
ProteinDPOGS214860-PA646 aa
Genomic positionDPSCF300091 + 61618-71708
RNAseq coverage132x (Rank: top 56%)
Annotation
HeliconiusHMEL0150120.085.76% 
BombyxBGIBMGA010070-TA0.077.67% 
DrosophilaCG6432-PA0.054.97% 
EBI UniRef50UniRef50_Q7PXM00.058.91%AGAP001473-PA n=4 Tax=Endopterygota RepID=Q7PXM0_ANOGA
NCBI RefSeqXP_969296.10.062.15%PREDICTED: similar to AGAP001473-PA [Tribolium castaneum]
NCBI nr blastpgi|910935810.062.15%PREDICTED: similar to AGAP001473-PA [Tribolium castaneum]
NCBI nr blastxgi|910935810.062.15%PREDICTED: similar to AGAP001473-PA [Tribolium castaneum]
Group
Gene OntologyGO:00081522.3e-82metabolic process
GO:00038242.3e-82catalytic activity
KEGG pathwaytca:6577610.0 
 K01908 (E6.2.1.17, prpE)maps-> Propanoate metabolism
InterPro domain[94-531] IPR0008732.3e-82AMP-dependent synthetase/ligase
Orthology groupMCL13921 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214860-TA
ATGGCTGAAGACGATGGAGCGGTCACTGAAGAATATCAGAAAGCCTATAGAAGTTCCCTTGAGAACCCTGAGAAGTTCTGGGGAGAGGTGGCCCAAGAGATCGAATGGACACGTCCTTGGGATCGTGTCTTGGACAACAGCAACCCACCTTTCACCAAATGGTTTGTGGGCGGGGAATTGTCTGTCTGTTACAATGCTGTAGACCGTCACGTGCCAACCAAAGGTGATCAAATCGCTTTGGCTTATGACTCCCCTTTGACTGACACCGTCAAGCGCATCACCTACGCCGAACTGCAGGATCAGGTATCTCGACTGGCTGGACGACTGTCTTCTCTGGGTGTAGAGCGAGGATCTCGAGTACTCATTTATATGCCGCTAATCCCAGAAGCTGTGGTGGCTATGTTGGCTACCAATCGGATAGGAGCCATACATTCAGTGGTTTTTGGAGGTTTTGCTGCAAGAGAGTTAAGAACAAGAATTGAACATGCGGAACCAACTGTTATCATAGCAGCGAGCTGTGGAGTGGAACCAAACAAAATAGTCAGGTACAAAGACATATTGGACGACGCACTATGTCAAAGTTCGCACACGCCTCAGAAGTGTATAATATACCAAAGACGGCGCGTACTAGAATGTTCCCTGCAGAAGGGAAGAGATATGTGCTGGGATGAAGCGCTGCAGGCGGAACCTGTTCCGTGTGTTCCAACTGAAGCTAATGAAGCGTTGTACATATTGTATACATCTGGTACCACGGATGATCCGAAGGGAATCCAACGGCCGTGTATGCATGCGGCTACGCTGTGTTGGTCAATGCGAGCTGTTTACGGTTTGAAGAGTGGTGTGTGGTGGGCTGCGTCGGATCTTGGATGGGTGGTCGGTCATTCCTATATATGTTACGGACCTCTGCTAGCAGGGCTTACAACCGTTCTGTATGAAGGGAAACCAGATCGAACTCCAGACCCTGGACAGTATTATAGGATTATAGAACAACATAGAGTCAACGCTCTCTTTACGATACCAACGGCTTTCCGCGTCCTCAAGAGGGCTGACCCCAATGGAAAATATGCTCGAAGATATTGCCAAAATACATTAAAGACGATATTCATTGCTGGAGAGCATTGTGATCACGAAACCAGACGCTGGGCTGAAAATATCTTCGGTGTATCTGTTCTGAACCACTGGTGGCAGACAGAGACAGGCTCCCCCATCACTGCCGCATGTCTTGGATATGGGATGAAAGGGATACGGCCACATTCAACCGGATATCCTGTGCCTGGATATGATTTACGTGCTCTGAGAGAGGATGGTACTGAATGTAAATCGGGAGAAGTTGGTCGTTTGGTTGCAAAACTACCTCTGCCTCCAGGATTCGCTTCAACATTGTGGCAATCAGATGAGCGGTTTAAGAAAGTATACTTCGACGCTTATCAGGGTTATTACGATACTCAAGATGTTGGTTGGATAAGCGCTGAATCTGCTGTTTGGGTTGTGGCTAGAGCTGATGACGTCATCAACGTCGCTGGCCACAGGCTCTCGACAGCCGCCATCGAAGATGTCGTTCTGAAGCACGCCAGGGTTGCGGATGCAGTGGTCGTCGGAGCCCCCGACCCGACTAAAGGGGATGTCCCTCTTTGCTTATACGTCATGCGTCCTCCGCAAGACGACGAAGAAATGGTGACGGAGAGCACGGTCACACAAGAGCTGATAGCGTTGGTGCGACACTTGATTGGACCGATTGCAGCCTTTCGGAAGGCTGTAGCTGTACCAGCACTGCCTCGCACACGATCTGGAAAGGCTTTGAGAGGAGCTATATCGAGACTGGCCAGGTGCCAGCAGATTAAGCTACCGGCTACCATCGAAGATCCAAGCGTGTTCGGCGAAATAAAGGTCGCATTGCAAAAATTTGGATACGCCATTGATGCACCGGACCCTGAAATGTAA

Protein sequence:

>DPOGS214860-PA
MAEDDGAVTEEYQKAYRSSLENPEKFWGEVAQEIEWTRPWDRVLDNSNPPFTKWFVGGELSVCYNAVDRHVPTKGDQIALAYDSPLTDTVKRITYAELQDQVSRLAGRLSSLGVERGSRVLIYMPLIPEAVVAMLATNRIGAIHSVVFGGFAARELRTRIEHAEPTVIIAASCGVEPNKIVRYKDILDDALCQSSHTPQKCIIYQRRRVLECSLQKGRDMCWDEALQAEPVPCVPTEANEALYILYTSGTTDDPKGIQRPCMHAATLCWSMRAVYGLKSGVWWAASDLGWVVGHSYICYGPLLAGLTTVLYEGKPDRTPDPGQYYRIIEQHRVNALFTIPTAFRVLKRADPNGKYARRYCQNTLKTIFIAGEHCDHETRRWAENIFGVSVLNHWWQTETGSPITAACLGYGMKGIRPHSTGYPVPGYDLRALREDGTECKSGEVGRLVAKLPLPPGFASTLWQSDERFKKVYFDAYQGYYDTQDVGWISAESAVWVVARADDVINVAGHRLSTAAIEDVVLKHARVADAVVVGAPDPTKGDVPLCLYVMRPPQDDEEMVTESTVTQELIALVRHLIGPIAAFRKAVAVPALPRTRSGKALRGAISRLARCQQIKLPATIEDPSVFGEIKVALQKFGYAIDAPDPEM-