Monarch geneset OGS2.0

DPOGS216075
TranscriptDPOGS216075-TA1773 bp
ProteinDPOGS216075-PA590 aa
Genomic positionDPSCF300067 + 487817-490298
RNAseq coverage125x (Rank: top 57%)
Annotation
HeliconiusHMEL0089400.069.95% 
BombyxBGIBMGA008873-TA0.071.96% 
DrosophilaCG16935-PA1e-8753.42% 
EBI UniRef50UniRef50_E3WS607e-9052.24%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WS60_ANODA
NCBI RefSeqXP_974428.12e-9756.21%PREDICTED: similar to zinc binding dehydrogenase [Tribolium castaneum]
NCBI nr blastpgi|3323748841e-9656.57%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323748842e-9456.57%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00082708.3e-124zinc ion binding
GO:00551148.3e-124oxidation-reduction process
GO:00164918.3e-124oxidoreductase activity
GO:00054889.2e-50binding
GO:00167471.8e-06transferase activity, transferring acyl groups other than amino-acyl groups
KEGG pathwaytca:6632795e-97 
 K07512 (MECR, NRBF1)maps-> Fatty acid elongation in mitochondria
InterPro domain[1-305] IPR0020858.3e-124Alcohol dehydrogenase superfamily, zinc-type
[106-279] IPR0160409.2e-50NAD(P)-binding domain
[1-129] IPR0110329.6e-33GroES-like
[137-254] IPR0131495.3e-14Alcohol dehydrogenase, C-terminal
[12-71] IPR0131548.6e-12Alcohol dehydrogenase GroES-like
[2-307] IPR0208431.8e-06Polyketide synthase, enoylreductase
Orthology groupMCL13838 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216075-TA
ATGAAAGAAGTTGAGGTGCCTCCACTTAAAAATCAAGAAGTTCTTGTAAGGATGTTAGCAGCACCGGTCAACCCGGCGGATATAAATACTATTCAAGGCAAATATCCAGTCAAAATTACACTGCCATCTATCCCAGGTAATGAAGGAGTTGGTGTTGTTGAGAGTGTGAGTGATGGAGTGAAGAACATTTGTCCTGGAGACAGAGTCATTATAGTTAAACCATTAAATGGTACTTGGAGGGATGTGGCCATTCTGAATCAGCAAGTATTGAGAGTTGTTCCTAAAGAACTTGGTCTAGTCGAAGCAGCTACGCTTACTGTTAATCCTTGTACTGCATACAGAATGCTATCAGACTTTAAAAATGTTAAAGATGGTCTTGTGGTTATACAAAATGGAGCAAATAGCGCTTGTGGTCAAATGGTAATACAAATCTGTAAGGCATGGGGAGTAAAAAATATAAATATAGTCAGAAACCGTCCCGAAATCAATGAATTGAAGGAGTATTTAAAATGTTTAGGTGCAACATACGTTCTCACTGAGGAGGAGTTGCGGACAACAACTATATTTAAAGAAAATAAAATTGATAGACCATCATTGGCATTGAATTGTGTTGGTGGGAAAAATGCTTTAGAAATGGTAAGACATTTGCAGAAATCTGCTGCGGTTGTAACTTATGGGGCAATGTCGAGAGAACCTGTGACTATTCCAAATGCCTCACTTATCTTCAAAAACATATCCTTCCATGGATTCTGGATGACAGCTTGGAATGAGAAAGCTCCTCAGGATAAGAAAGAGGATATGATCTGTGATATTATTAACCTAATGCTCGAAAAGAAACTTAATTGCCCCGTTCACAAGATGGTAAAATTTGATGACTATAAAACAGCTATAGGACAAACACTATCTACAAAAGGTTTTACTGGATGTAGGAACCCAATTCTTCCCCGTTTGCCAAGTATCCCTGGAGATGAAGGTGTTGGTGATGTCGTAGAAATTGGTGAACTAGTGTGTGCTGTAGAACCTGGTGAAAGAGTAGTGTTAACATCAAGAATGCTTGGTACTTGGTGCAAATACGGAATTTATAATGAACGAGATGTCCATGTTATTTCCCCAAATATTCCCCTCCCTGAAGCAGCTATGTTAACGATAGCACCTTGTACTGCCTACCGCATGCTCAAAGATTTCAGGAAGATGAAACCTGGAGATACTGTAATACAGAACGCTGCTAATAGCCCTTGTGGTCAATCAGTAATACAACTTTGTAAGGCGTGGGGCATAAATACATTGAATATAGTCGCTAGTCATTGTGGTTATGAATGTGTAAAAGAAAATCTTTTGAAAATAGGAGCTACAGCAGTTTATACTCTTGAAGAGGCGGAAGAACTTATGGTTTTTAACACATCTGTGACTAGACCTGTTTTAGCTTTAAACTGTTTAGGGGGTAGATTTGAGGATGTACTATTACGACTTCTCGACAAATCGGGTACAATTGTATACTATGGTTGTGCTTTTGATATACCGATTTGTAAACATATCCTACGTTCTGATGTATTTTTCAATAGATTTCATCTAGGTGCTTGGGACGCTTATGCAAGTGTTCTTGAAAAGGATGTTATGATGAACAGAATTGTTAATTTAATTGTGCAAGGGAAATTTAAAGCTCCTTTTTACAAGCCTTTAGAAATTAAAAATTATATATACGCATTACAAAACACTGTTCATTGTGAAGCATTTGCGACAACTAATTTTGTCTTTGACTTCACTTTAACATAA

Protein sequence:

>DPOGS216075-PA
MKEVEVPPLKNQEVLVRMLAAPVNPADINTIQGKYPVKITLPSIPGNEGVGVVESVSDGVKNICPGDRVIIVKPLNGTWRDVAILNQQVLRVVPKELGLVEAATLTVNPCTAYRMLSDFKNVKDGLVVIQNGANSACGQMVIQICKAWGVKNINIVRNRPEINELKEYLKCLGATYVLTEEELRTTTIFKENKIDRPSLALNCVGGKNALEMVRHLQKSAAVVTYGAMSREPVTIPNASLIFKNISFHGFWMTAWNEKAPQDKKEDMICDIINLMLEKKLNCPVHKMVKFDDYKTAIGQTLSTKGFTGCRNPILPRLPSIPGDEGVGDVVEIGELVCAVEPGERVVLTSRMLGTWCKYGIYNERDVHVISPNIPLPEAAMLTIAPCTAYRMLKDFRKMKPGDTVIQNAANSPCGQSVIQLCKAWGINTLNIVASHCGYECVKENLLKIGATAVYTLEEAEELMVFNTSVTRPVLALNCLGGRFEDVLLRLLDKSGTIVYYGCAFDIPICKHILRSDVFFNRFHLGAWDAYASVLEKDVMMNRIVNLIVQGKFKAPFYKPLEIKNYIYALQNTVHCEAFATTNFVFDFTLT-