Monarch geneset OGS2.0

DPOGS209633
TranscriptDPOGS209633-TA1020 bp
ProteinDPOGS209633-PA339 aa
Genomic positionDPSCF300015 + 893120-895051
RNAseq coverage20x (Rank: top 79%)
Annotation
HeliconiusHMEL0170346e-13267.75% 
BombyxBGIBMGA006698-TA4e-8676.72% 
DrosophilaCG18327-PA3e-6340.55% 
EBI UniRef50UniRef50_E0VZ281e-6640.00%Mitochondrial 2-oxoglutarate/malate carrier protein, putative n=3 Tax=Pancrustacea RepID=E0VZ28_PEDHC
NCBI RefSeqXP_002431372.12e-6740.00%mitochondrial 2-oxoglutarate/malate carrier protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420218835e-6640.00%mitochondrial 2-oxoglutarate/malate carrier protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420218831e-6340.00%mitochondrial 2-oxoglutarate/malate carrier protein, putative [Pediculus humanus corporis]
Group
KEGG pathway 
InterPro domain[2-239] IPR0233956.6e-46Mitochondrial carrier domain
[100-195] IPR0181082.2e-18Mitochondrial substrate/solute carrier
Orthology groupMCL17617 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209633-TA
ATGGACTTTGTAATCGGTGGCTTAGCCGGTGCAGGCGCTACCATATTTACGAATCCCATGGACGTGGTAAAAACCCGGCTACAGTTACAAGGGGAGCTGCGCGCACGCACCGAGCACACCGCAAGATATAGGGGAATTTTTCATGGGGTATACGTCATAGCTAAGACAGACGGAGCATTAGCACTCCAGAAAGGTTTAGTACCAGCTATGGTTCTCGGTTTCTGTATGAATTCGGTGAGGTTGGGAATGTATCACGTGGCTGACGTTCAGGGTTGGACACGAACGACAGATGGTGATATCAGCATTCACAAGACCATGTTTTGGTCGAGCGCGAGCGGCGTTATGAGTGGACTAGCCGCCAACCCTGCATCTGTCGTGAAGACAAGAATGCAGGCGGCAGCTCATCCTAGTATCGCTGTCGGAAGACAGTATGTCTATAATGGTATGATAGACGGTTGCGTTAAAATTTACAAAATGGAGGGAATAAAAGGTTTCTTTGCGGGAGTGAACGCGACATGCACTAGATTGGCCGTTGGCAGTGCAGCCCAACTCACAACATTTTCAACTGCAAAAGAAACTTTGCTGTATTATGGCATATGCGAGAAAACACCTTTGGGACTTGCATTTGCGGCAAGCTGTTTAAGCGGCCTTATGGTAGCTCTAGCGATCTGCCCACTTGATGTCGTAGCTGTACGACTATACAATCAGGGTTACATGGAGAATTCTCCAGCAGGCCTCGCGTTCACCTCCAGCATAGCTTGTGGGATCGTATGTGTTTTGTTGGAAACGCCGCTGGATGTAGTCAATACGCGGCTTTATAATCAGGGGCCTGCGAAACAAGGAAAACTCCTCTACAACGGAGTTCTTGACTGTCTCAGAAAAATTTATATGACTGAAGGACTTCACGGCCTCTATAAAGGCATAGGACCACTCTACCTCAGAATAGCACCTCATACTACCCTATCTCTCGTAATATGGGACATGCTGAATATAATAGTCATGAATAAAAGGAAATCATAA

Protein sequence:

>DPOGS209633-PA
MDFVIGGLAGAGATIFTNPMDVVKTRLQLQGELRARTEHTARYRGIFHGVYVIAKTDGALALQKGLVPAMVLGFCMNSVRLGMYHVADVQGWTRTTDGDISIHKTMFWSSASGVMSGLAANPASVVKTRMQAAAHPSIAVGRQYVYNGMIDGCVKIYKMEGIKGFFAGVNATCTRLAVGSAAQLTTFSTAKETLLYYGICEKTPLGLAFAASCLSGLMVALAICPLDVVAVRLYNQGYMENSPAGLAFTSSIACGIVCVLLETPLDVVNTRLYNQGPAKQGKLLYNGVLDCLRKIYMTEGLHGLYKGIGPLYLRIAPHTTLSLVIWDMLNIIVMNKRKS-