Monarch geneset OGS2.0

DPOGS210806
TranscriptDPOGS210806-TA3168 bp
ProteinDPOGS210806-PA1055 aa
Genomic positionDPSCF300027 - 783057-795752
RNAseq coverage7133x (Rank: top 2%)
Annotation
HeliconiusHMEL0078300.092.78% 
BombyxBGIBMGA007121-TA0.092.38% 
DrosophilaAcon-PB0.079.71% 
EBI UniRef50UniRef50_Q997980.075.09%Aconitate hydratase, mitochondrial n=92 Tax=cellular organisms RepID=ACON_HUMAN
NCBI RefSeqXP_391994.10.082.70%PREDICTED: similar to Aconitase CG9244-PB [Apis mellifera]
NCBI nr blastpgi|289717120.090.07%hypothetical protein [Antheraea yamamai]
NCBI nr blastxgi|289717120.090.07%hypothetical protein [Antheraea yamamai]
Group
Gene OntologyGO:00060991.5e-293tricarboxylic acid cycle
GO:00039941.5e-293aconitate hydratase activity
GO:00515391.5e-2934 iron, 4 sulfur cluster binding
GO:00081521.4e-180metabolic process
KEGG pathwayame:4084460.0 
 K01681 (ACO, acnA)maps-> Citrate cycle (TCA cycle)
    Reductive carboxylate cycle (CO2 fixation)
    Glyoxylate and dicarboxylate metabolism
InterPro domain[20-1052] IPR0062480Aconitase, mitochondrial-like
[20-1052] IPR0159370Aconitase/isopropylmalate dehydratase
[70-509] IPR0010301.4e-180Aconitase/3-isopropylmalate dehydratase large subunit, alpha/beta/alpha
[834-1050] IPR0159282.4e-93Aconitase/3-isopropylmalate dehydratase, swivel
[46-235] IPR0159314.5e-67Aconitase/3-isopropylmalate dehydratase large subunit, alpha/beta/alpha, subdomain 1/3
[857-984] IPR0005733.5e-46Aconitase A/isopropylmalate dehydratase small subunit, swivel
[236-348] IPR0159324.8e-42Aconitase/3-isopropylmalate dehydratase large subunit, alpha/beta/alpha, subdomain 2
Orthology groupMCL10855 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210806-TA
ATGGCTCACTGTACGAGAGTTTTACATGGCCAAGGTGGCAGGACGAGGCTTGCGCTCGTGGAAATTCAGAAACGATGCTTCAGTGTATCACCATTAACCGCAGCAGCGGCCCAGGTCGCGATGTCAAAATTCGACAAAGCTCCTCTCCCGTACCAAAAGTTAACTCAGAATATCGAAATTGTGAAGAAAAGGTTGGGCCGTGATTTGACCTTGTCCGAGAAGGTCTTGTACTCTCATCTGGACGACCCTAAAGGACAGGAAATCGAACGTGGTACCAGCTATCTCCGCCTTCGTCCTGACCGTGTTGCGATGCAAGATGCCACAGCGCAGATGGCCATGCTACAGTTCATATCTTCAGGACTCCCCAGAGTGGCTGTTCCATCTACCATCCACTGCGATCACTTGATTGAGGCCCAAATCGGCGGAGATAAAGATTTAGCAAGAGCCAAGGATATTAACAAAGAGGTATACAAGTTCCTTGAAACAGCCGGTGCTAAGTACGGAGTGGGCTTCTGGAAACCTGGTTCTGGTATCATTCACCAGATCATCTTGGAGAATTACGCCTTTCCTGGTTTACTGATGATCGGTACCGATTCTCACACTCCTAACGGCGGTGGTCTTGGTGGACTCTGCATTGGTGTTGGCGGAGCTGATGCTGTAGATGTAATGGCCGACATCCCCTGGGAGCTGAAGTGTCCCAAGGTCATCGGTGTCAAACTCACAGGCAAGCTGAAGGGCTGGACGAGCCCTAAGGATGTGATCCTGAAGGTAGCCGGCATCCTGACAGTGAAGGGAGGCACGGGCGCCATCGTGGAGTACCACGGACCTGGCGTCGACTCCATCTCGTGTACCGGCATGGCAACCATCTGTAACATGGGGGCCGAGATCGGCGCTACCACCAGTGTGTTCCCTTACAACGCCCGTATGGAGGCCTACCTTAAATCCACCGGCCGTCACGACATCGCCTCCGCCGCTAACAGCTACAAGAACCTCCTCACACCTGATCCCAAGGCGCCTTACGACCAGTTAATCGAAATCGACCTGTCTACATTGGAGCCCCACGTGAACGGTCCCTTCACGCCCGACCTCGCCAACCCGATCTCCAAGTTGGGAGAGATCGCGAAGAAGAACGACTGGCCTATCGACATCAAGGTCGGCCTCATTGGCTCGTGCACCAACTCCTCGTACGAAGACATGGGACGCTGTGCCAGTATTGTCAAGGAGGCCCTGAGTCACGGCGTCAAATCCAAGATACCGTTCAACGTGACCCCCGGCTCGGAACAAGTGCGCGCCACCATAGAGAGAGACGGCATCGCGCAGACACTCAGGGACTTCGGAGGAACCGTGTTGGCGAACGCCTGCGGTCCGTGCATCGGTCAATGGGACCGCAAGGACGTGAAGAAGGGGGATAAGAACACCATAGTGACGTCATACAACAGAAACTTCACCGGCAGAAACGACGCCAACCCCGCCACCCACTGCTTCGTCACCAGCCCCGAGCTGGTCACGGCTCTCTCACTGGCCGGCAAGCTGAAGGGCTGGACGAGCCCTAAGGATGTGATCCTGAAGGTAGCCGGCATCCTGACAGTGAAGGGAGGCACGGGCGCCATCGTGGAGTACCACGGACCTGGCGTCGACTCCATCTCGTGTACCGGCATGGCAACCATCTGTAACATGGGGGCCGAGATCGGCGCTACCACCAGTGTGTTCCCTTACAACGCCCGTATGGAGGCCTACCTTAAATCCACCGGCCGCCACGACATCGCCTCCGCCGCTAACAGCTACAAGAACCTCCTCACGCCTGATCCCAAGGCGCCTTACGACCAGTTAATCGAAATCGACCTGTCTACATTGGAGCCCCACGTGAACGGTCCCTTCACGCCCGACCTCGCCAACCCGATCTCCAAGTTGGGAGAGATCGCGAAGAAGAACGACTGGCCTATCGACATCAAGGTCGGCCTCATTGGCTCGTGCACCAACTCCTCGTACGAAGACATGGGACGCTGTGCCAGTATTGTCAAGGAGGCTTTGAGTCACGGCGTCAAATCCAAGATACCGTTCAACGTGACCCCCGGCTCGGAACAAGTGCGCGCCACCATAGAGAGAGACGGCATCGCGCAGACACTCAGGGACTTCGGAGGAACCGTGTTGGCGAACGCCTGCGGTCCGTGCATCGGTCAATGGGACCGCAAGGACGTGAAGAAGGGGGATAAGAACACCATAGTGACGTCATACAACAGAAACTTCACCGGCAGAAACGACGCCAACCCCGCCACCCACTGCTTCGTCACCAGCCCCGAGCTGGTCACGGCTCTCTCACTGGCCGGCCGTCTTGATTTCAATCCCCTGACCGACTCCCTGAAGGGCAAGGATGGCAAGGAGTTCAAACTATCAGATCCCTTCGCTGATGAGTTACCAGCTAAAGGCTTCGACCCCGGCCAGGACACGTACGAACACCCGCCAGCCGATGGATCCAAGGTCAAAGTGGACGTGTCTCCTACATCAGACCGCCTGCAGTTGTTGGAACCTTTCGACAAGTGGGATGGCAAGGACCTCAGTGACCTGACCATACTCATCAAGGTGAAGGGGAAATGCACCACGGACCACATCTCCGCCGCCGGGCCCTGGCTCAAGTACAGAGGTCATCTGGACAACATCTCCAATAACATGTTCATAACCGCAACGAACGCTGAGAACGGCGAGCTGAACAAGGTCCGCAACCTGCAGACGGGCGAGTGGGGGCCTGTCCCGGCGACCGCCCGGGCTTACAAGGCGGCCGGCGTGAGGTGGTGTGTGGTGGGAGACGAGAACTACGGAGAGGGCTCCAGCAGGGAGCACGCCGCCCTGGAGCCGAGGCACCTCGGGGGACGAGCCATCATAGTCAAATCCTTCGCCAGAATCCACGAGACCAACTTGAAGAAGCAGGGTCTACTGCCGCTGACCTTCGCCAACCCCTCGGACTACGACAAGATCCAGCCATCGGACAAGATCTCACTGCTGGGACTCAACGACCTGGCGCCTGGAAAGCAAGTGGAATGTGAAATCAAACACAAGGACGGTAAATCCGAGCGCATCAAACTGAACCACTCTTTGAACGAGCAACAGATCTCGTGGTTCAAGGCTGGGTCAGCTCTCAACAGGATGAAGGAGATCGCCGCTGGGAAGTAA

Protein sequence:

>DPOGS210806-PA
MAHCTRVLHGQGGRTRLALVEIQKRCFSVSPLTAAAAQVAMSKFDKAPLPYQKLTQNIEIVKKRLGRDLTLSEKVLYSHLDDPKGQEIERGTSYLRLRPDRVAMQDATAQMAMLQFISSGLPRVAVPSTIHCDHLIEAQIGGDKDLARAKDINKEVYKFLETAGAKYGVGFWKPGSGIIHQIILENYAFPGLLMIGTDSHTPNGGGLGGLCIGVGGADAVDVMADIPWELKCPKVIGVKLTGKLKGWTSPKDVILKVAGILTVKGGTGAIVEYHGPGVDSISCTGMATICNMGAEIGATTSVFPYNARMEAYLKSTGRHDIASAANSYKNLLTPDPKAPYDQLIEIDLSTLEPHVNGPFTPDLANPISKLGEIAKKNDWPIDIKVGLIGSCTNSSYEDMGRCASIVKEALSHGVKSKIPFNVTPGSEQVRATIERDGIAQTLRDFGGTVLANACGPCIGQWDRKDVKKGDKNTIVTSYNRNFTGRNDANPATHCFVTSPELVTALSLAGKLKGWTSPKDVILKVAGILTVKGGTGAIVEYHGPGVDSISCTGMATICNMGAEIGATTSVFPYNARMEAYLKSTGRHDIASAANSYKNLLTPDPKAPYDQLIEIDLSTLEPHVNGPFTPDLANPISKLGEIAKKNDWPIDIKVGLIGSCTNSSYEDMGRCASIVKEALSHGVKSKIPFNVTPGSEQVRATIERDGIAQTLRDFGGTVLANACGPCIGQWDRKDVKKGDKNTIVTSYNRNFTGRNDANPATHCFVTSPELVTALSLAGRLDFNPLTDSLKGKDGKEFKLSDPFADELPAKGFDPGQDTYEHPPADGSKVKVDVSPTSDRLQLLEPFDKWDGKDLSDLTILIKVKGKCTTDHISAAGPWLKYRGHLDNISNNMFITATNAENGELNKVRNLQTGEWGPVPATARAYKAAGVRWCVVGDENYGEGSSREHAALEPRHLGGRAIIVKSFARIHETNLKKQGLLPLTFANPSDYDKIQPSDKISLLGLNDLAPGKQVECEIKHKDGKSERIKLNHSLNEQQISWFKAGSALNRMKEIAAGK-