Monarch geneset OGS2.0

DPOGS214338
TranscriptDPOGS214338-TA1674 bp
ProteinDPOGS214338-PA557 aa
Genomic positionDPSCF300020 - 437110-438783
RNAseq coverage119x (Rank: top 58%)
Annotation
HeliconiusHMEL0200420.085.46% 
BombyxBGIBMGA004140-TA0.085.46% 
DrosophilaCG3267-PA0.065.36% 
EBI UniRef50UniRef50_F6T8820.065.02%Uncharacterized protein n=42 Tax=root RepID=F6T882_MONDO
NCBI RefSeqXP_001664206.10.067.24%Methylmalonyl-CoA carboxyltransferase 12S subunit, putative [Aedes aegypti]
NCBI nr blastpgi|1571382700.067.24%Methylmalonyl-CoA carboxyltransferase 12S subunit, putative [Aedes aegypti]
NCBI nr blastxgi|1571382700.067.24%Methylmalonyl-CoA carboxyltransferase 12S subunit, putative [Aedes aegypti]
Group
Gene OntologyGO:00168744.5e-150ligase activity
KEGG pathwayaag:AaeL_AAEL0139670.0 
 K01969 (E6.4.1.4B)maps-> Valine, leucine and isoleucine degradation
InterPro domain[68-546] IPR0000224.5e-150Carboxyl transferase
Orthology groupMCL13255 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214338-TA
ATGCTCACGTTACTTAAAGGCGTACGGCGATTAAATGTCCAAAGCCGGCATTACAGTTACGCCAACGTGATAGGAAGTGAGCCGAATAGAAATGATCCTTATTACGCTGAGAACAAGGAGAAAATGGACGAGCTCGTGAGGGAACTCCGTCAGAAAACAAATGACGCTATCAAAGGCGGTCCCGAGGAAGCAGTCAAAAGACATATTGCTAGAGGAAAACTGCTTGTAAGAGACAGGATTAACCGGTTAGTGGACGATGGAAGTGACGTCTTAGAATTAAGTACTCTGGCCGCTACGGACATGTACAAAGGTCAGGTCCCGGGCGCCGGCATAGTTACAGCTATCGGCCAAGTTCAAGGTCGGGATTGTATGATTGTTGCCAATGATGCTACTGTTAAAGGAGGCACGTATTTTCCTATGACCATTAAAAAGCATCTAAGGGCACAAGAAATCGCTAAGGAATGTCGTCTACCTTGCATTTATCTCGTTGATTCTGGTGGCGCCCATTTGCCTGATCAAGCGGACGTATTCCCTGACAGAGAACATTTCGGTAGAATATTTTTTAATCAGGCTAATATGTCTGCTGAAGGTATCCCTCAAATTTCTGTGGTGATGGGATCTTGCACCGCCGGTGGCGCTTATATCCCTAGCATGTCTGACGAAAGTATCATTGTTAAAAAGCAGGGTACAATTTTCTTAGCTGGTCCACCGCTTGTTAAAGCTGCTACTGGAGAAACAGTTTCAGCGGAAGATTTAGGTGGGGCAGACCTACATTGCCGTAAGTCAGGTGTCACAGATCATTACGCATTAGATGACGAGCACGCCTTGCAATTGGCTAAAAATGTGGTTGCTAATCTAAACTGGCATAATGACCAAAGAATTCGAGTTTATGCATCGGATATAGACGAACCATTACATGATATCGATGACTTACACGGAATAGTTGGCGCGAATTTGCAAAGACCTTTTGACATCCGAGAAGTTATTGCTAGAATAGTTGACGGAAGCAGGTTCCATGAATTCAAACAACTTTATGGTGAGACTTTAGTATGTGGGTTTGCATCTGTTTATGGTAATCCTGTCGGAGTTTTAGGCAATAACGGCGTTTTACATTCAGAAGCTGCCTTAAAAGGAGCTCATTTTATTCAACTATGTGCTGCGCGAAAAATACCCTTACTGTTTCTACAAAACATCACTGGTTTCATGGTAGGTCCAGAAGCCGAGGCTGGAGGTATCGCTAAAAATGGTGCCAAATTGGTCACAGCTGTGAGTTGCTTTAAGGGGCCCAAAGTTACAGTACTAGTTGGTGGCAGTTTCGGTGCTGGTAACTATGGTATGTGTGGCAGAGCATATTCACCAAGTTTCCTTTATATGTGGCCGAATGCCAGAATATCTGTGATGGGTGGTCCACAAGCTGCTACAGTGCTTTCATTAGTTGCAAAAGAAAAGGCGAACAGGGAAAAGAAAGAGTGGACAGAAGAAGATGAAAAGAAAGTGAAAGATCCATTGGAAGCGAAGTTTGATCTTGAAGGCCGACCATATTACAGTACAGCTCGTTTATGGGATGATGGGATCTTGGCCCCGAAGGATACCAGGAAAGTAGTGGGTCTGAGTATATCAGCGGCACTAAACGCACCATTTAGGGACAGTAAGTTTGGCATATTCAGAATGTGA

Protein sequence:

>DPOGS214338-PA
MLTLLKGVRRLNVQSRHYSYANVIGSEPNRNDPYYAENKEKMDELVRELRQKTNDAIKGGPEEAVKRHIARGKLLVRDRINRLVDDGSDVLELSTLAATDMYKGQVPGAGIVTAIGQVQGRDCMIVANDATVKGGTYFPMTIKKHLRAQEIAKECRLPCIYLVDSGGAHLPDQADVFPDREHFGRIFFNQANMSAEGIPQISVVMGSCTAGGAYIPSMSDESIIVKKQGTIFLAGPPLVKAATGETVSAEDLGGADLHCRKSGVTDHYALDDEHALQLAKNVVANLNWHNDQRIRVYASDIDEPLHDIDDLHGIVGANLQRPFDIREVIARIVDGSRFHEFKQLYGETLVCGFASVYGNPVGVLGNNGVLHSEAALKGAHFIQLCAARKIPLLFLQNITGFMVGPEAEAGGIAKNGAKLVTAVSCFKGPKVTVLVGGSFGAGNYGMCGRAYSPSFLYMWPNARISVMGGPQAATVLSLVAKEKANREKKEWTEEDEKKVKDPLEAKFDLEGRPYYSTARLWDDGILAPKDTRKVVGLSISAALNAPFRDSKFGIFRM-