Monarch geneset OGS2.0

DPOGS200503
TranscriptDPOGS200503-TA1908 bp
ProteinDPOGS200503-PA635 aa
Genomic positionDPSCF300450 - 94454-114767
RNAseq coverage1214x (Rank: top 10%)
Annotation
HeliconiusHMEL0107410.090.28% 
BombyxBGIBMGA001725-TA0.086.80% 
DrosophilaIrp-1B-PA0.074.49% 
EBI UniRef50UniRef50_P213990.068.46%Cytoplasmic aconitate hydratase n=283 Tax=root RepID=ACOC_HUMAN
NCBI RefSeqXP_001843386.10.077.59%iron-responsive element-binding protein 1 [Culex quinquefasciatus]
NCBI nr blastpgi|154187860.089.58%iron regulatory protein 1 [Manduca sexta]
NCBI nr blastxgi|154187860.089.58%iron regulatory protein 1 [Manduca sexta]
Group
Gene OntologyGO:00081520metabolic process
KEGG pathwaycqu:CpipJ_CPIJ0021180.0 
 K01681 (ACO, acnA)maps-> Citrate cycle (TCA cycle)
    Reductive carboxylate cycle (CO2 fixation)
    Glyoxylate and dicarboxylate metabolism
InterPro domain[4-596] IPR0159340Aconitase/Iron regulatory protein 2/2-methylisocitrate dehydratase
[4-596] IPR0159370Aconitase/isopropylmalate dehydratase
[21-595] IPR0010301.3e-212Aconitase/3-isopropylmalate dehydratase large subunit, alpha/beta/alpha
[45-261] IPR0159315e-83Aconitase/3-isopropylmalate dehydratase large subunit, alpha/beta/alpha, subdomain 1/3
[262-382] IPR0159321.1e-32Aconitase/3-isopropylmalate dehydratase large subunit, alpha/beta/alpha, subdomain 2
Orthology groupMCL10878 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200503-TA
ATGTTAAAAAATGACATAAAATCATATCTGGAATATAAAATTTCACACGTTCTTTCAGCGCCAAATCCATACAGCGGGCTCCTGAAGAAATTGGAAGTTAATAATCAGAGCTTCAGTTACTTTGACATAACGCAGCTCGGCGCTAAATATGATCGTCTCCCATTTAGCGTGCGCGTTCTCCTCGAGTCCTGCGTCCGTAACTGCGACAATTTCCAGGTGTTGGAGAAGGATGTGAACAATGTGCTGGATTGGGAGAAGAACCAGGCCGTCGAGGGTGGAGTGGAAATAGCTTTCAAACCAGCTCGGGTCATTTTACAGGATCTAACAGGTGTACCAGCTGTTGTGGACTTTGCAGCGATGCGTCACGCCGTCAAACAGCTGGGCGGTGATCCTGACAGAATAAACCCCATATGTCCAGCTGACCTGGTCATAGATCATTCCGTCCAGGTGGATTTTGCAAGAACCCCCGACGCTTTAAACAAAAACCAAGAGTTGGAGTTTGAACGCAACAAGGAGAGATTCCAGTTTTTGAAGTGGGGCGCTCAAGCGTTTGACAACATGCTGATAGTTCCACCTGGATCCGGTATAGTTCACCAAGTCAACTTGGAGTATCTAGCCCGAGTGGTGTTCACCAAAGATCTTCTGTATCCCGACTCGGTAGTGGGTACCGATTCTCACACCACCATGATCAATGGCCTGGGGGTCGTTGGCTGGGGTGTTGGAGGTATTGAAGCAGAGGCGGTGATGTTGGGTCAGGCTATCAGTATGTTGTTGCCAAAAGTGGTCGGTTACAAGTTGGTTGGTGAATTGAACCCACTAGCGACATCTACCGACTTAGTGTTAACTATTACTAAGCACCTCCGTTCCCTGGGCGTGGTTGGTAAATTCGTGGAGTTCTTCGGCCCGGGCGTGTGTGCTCTCAGTATAGCTGACCGCGCCACCGTAGCCAACATGTGCCCCGAGTTTGGTGCCACTGTCGCTCACTTCCCTGTGGACGAACGCTCGCTGGACTACCTCAGACAGACAAATCGTTCGGATGATAAGATCAAGAAAATTGAGGAATACTTGAAGGCAACTAAACAGTTTAGAGACTACAGCAATCCCGAGCAGGACCCCGTGTTCTCAGAGGTTGTGGAGTTGGATTTGTCAACAGTGGTGACATCTGTGAGCGGGCCCAAGAGACCTCAAGACAGAGTCGCTGTGAAAGATATGAAGGAAGACTTCAGGGCCTGTCTTAATAATAAGGTGGGCTTCAAGGGGTACGGGCTTACTCCCGCGCAACTCACCTCGTCGGGCAGTTTCTCGTACAGCGACGGGAACACTTACTCCATCACACACGGCTCGGTCATTATAGCCGCGATCACGTCCTGCACCAACACCTCCAACCCGAGCGTCATGTTGGGTGCCGGTTTGCTAGCAAAGAAGGCTGTGGAGAACGGTTTGTCCGTCCTTCCATACATCAAGACCTCGTTGTCGCCAGGGTCTGGTGTAGTCACGTATTATCTGAAGGAATCCGGCGTGGTTCCGTACCTGGAAAAGTTGGGTTTCGACATCGTGGGCTACGGCTGCATGACGTGCATCGGGAACTCGGGACCCATAGACGACAACATCGCCAACACCATAGAGAAGAACGAGTTGGTTTGCTGCGGCGTGCTCTCCGGCAACAGAAACTTTGAAGGTCGGATCCATCCAAACACGAGAGCTAATTACCTAGCGAGCCCCCTGCTGGTCATCGCATATGCTTTGGCCGGTACTGTTGACATCGACTTCGAGAAGCAACCGCTGGGTGAGTGTCGCATAGAAACGATCCGTATCCTCATTAGCTCACATCCAACACAAAGGCTTTATTATGGTTTTTTTAATTCTGTGACGTCATCGATTTTTTTTTTCAGGTCTATGAGAAGATAG

Protein sequence:

>DPOGS200503-PA
MLKNDIKSYLEYKISHVLSAPNPYSGLLKKLEVNNQSFSYFDITQLGAKYDRLPFSVRVLLESCVRNCDNFQVLEKDVNNVLDWEKNQAVEGGVEIAFKPARVILQDLTGVPAVVDFAAMRHAVKQLGGDPDRINPICPADLVIDHSVQVDFARTPDALNKNQELEFERNKERFQFLKWGAQAFDNMLIVPPGSGIVHQVNLEYLARVVFTKDLLYPDSVVGTDSHTTMINGLGVVGWGVGGIEAEAVMLGQAISMLLPKVVGYKLVGELNPLATSTDLVLTITKHLRSLGVVGKFVEFFGPGVCALSIADRATVANMCPEFGATVAHFPVDERSLDYLRQTNRSDDKIKKIEEYLKATKQFRDYSNPEQDPVFSEVVELDLSTVVTSVSGPKRPQDRVAVKDMKEDFRACLNNKVGFKGYGLTPAQLTSSGSFSYSDGNTYSITHGSVIIAAITSCTNTSNPSVMLGAGLLAKKAVENGLSVLPYIKTSLSPGSGVVTYYLKESGVVPYLEKLGFDIVGYGCMTCIGNSGPIDDNIANTIEKNELVCCGVLSGNRNFEGRIHPNTRANYLASPLLVIAYALAGTVDIDFEKQPLGECRIETIRILISSHPTQRLYYGFFNSVTSSIFFFRSMRR-