Monarch geneset OGS2.0

DPOGS201874
TranscriptDPOGS201874-TA1293 bp
ProteinDPOGS201874-PA430 aa
Genomic positionDPSCF300191 + 182273-197147
RNAseq coverage276x (Rank: top 39%)
Annotation
HeliconiusHMEL0128464e-5070.50% 
BombyxBGIBMGA006048-TA4e-8369.16% 
DrosophilaCG13284-PB4e-6649.04% 
EBI UniRef50UniRef50_E0V9512e-6950.00%Steroid dehydrogenase, putative n=8 Tax=Neoptera RepID=E0V951_PEDHC
NCBI RefSeqXP_001603427.18e-8156.65%PREDICTED: similar to ENSANGP00000013086 [Nasonia vitripennis]
NCBI nr blastpgi|910836894e-7955.29%PREDICTED: similar to steroid dehydrogenase isoform 1 [Tribolium castaneum]
NCBI nr blastxgi|2700078851e-7555.51%hypothetical protein TcasGA2_TC014627 [Tribolium castaneum]
Group
Gene OntologyGO:00054888.3e-50binding
GO:00081527.4e-24metabolic process
GO:00164917.4e-24oxidoreductase activity
KEGG pathwaymcc:7149625e-60 
 K10251 (KAR)maps-> Biosynthesis of unsaturated fatty acids
InterPro domain[170-355] IPR0160408.3e-50NAD(P)-binding domain
[169-335] IPR0021987.4e-24Short-chain dehydrogenase/reductase SDR
[168-185] IPR0023478.6e-17Glucose/ribitol dehydrogenase
Orthology groupMCL10876 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201874-TA
ATGTTAGGATTTATTTGTTTAGCTGTGATAGGCGCGATAACGGTGGCCGTGTTTTTGATTGATTCCCTGTGGAGCGTTTTGGAACTGATAACGTCATATTTAACACCTTATTTCATACCCACAGAAGTGTTACCCTTGTCAAAGAGGTTCGGACCTTGGGCTGCCGTGACCGGTTCCACGGATGGCATCGGTAAGGAGTACGCTCTCGAGCTGGCTCGGCTGGGGATGAACGTGGTGCTCATCAGTCGCAGCGAAGACAAGCTGAGAACAGTCTCCAGAGAGATCGAGAAGCTGCACGGGGTGAAAACCAAAATCATCGTAGCAGATTTCAGCAAAGGAACTGAGATTTATCAGAACATTGAGAATGGACTCAAGGATGTGCCCTTGGGTATCTTGGAGAAGCTGCACGGGGTGAAAACCAAAATCATCGTAGCGGATTTCAGCAAAGGCACTGAGATTTATCAGAACATTGAGAATGGACTCAAGGATGTGCCCTTGGGTATCTTGGCCGTGACCGGTTCCACGGATGGCATCGGTAAGGAGTACGCTCTCGAGCTGGCTCGGCTGGGGATGAACGTGGTGCTCATCAGTCGCAGCGAAGACAAGCTGAGAACAGTCTCCAGAGAGATCGAGAAGCTGCACGGGGTGAAAACCAAAATCATCGTAGCAGATTTCAGCAAAGGAACTGAGATTTATCAGAACATTGAGAATGGACTCAAGGATGTGCCCTTGGGTATCTTGGTGAATAACGTCGGAGTTCAATACGAGTATCCGATGCCGCTGGTGGAGTTGCCTGTGAGTAAAGCCTGGGAGCTGATCAGTGTGAACGTGGTCGCGGTGACAACCCTGACCCGCATGGTGCTGCCCGGGATGTTGGCCCGGGGGCGGGGGGCCGTCGTCAACGTGTCCTCGGGCTCCGAGCTGCAGCCCCTGCCGCTTATGGCTGTGTACGCTGCCACTAAGTCGTACGTGCGCAGCCTGACGCTGGCGCTCCGTGCGGAGGTGTCTCCGACTGTGACGGTGCAGCACGTGTCTCCGCTGTTCGTGTCCACTAAGATGAACACCTTCTCCCCCACACTCCTGGCCGGCAACCCGCTGGTGCCCGACGCGAGGACCTACGCCAGGCACGCCGTCCGCACGCTGGGGAGAGTCACCGCTACGTCCGGCTATTGGGTCCATGGCGTTCAGAGTTTCTTCATCAAACTAGCCCCGGAATGGGTCCGGATAAAGGTCGGCGCTCAAATGAACAGAGAATTCAGAGAGGAACACATGAGAGCGATCAAGAGACAATGA

Protein sequence:

>DPOGS201874-PA
MLGFICLAVIGAITVAVFLIDSLWSVLELITSYLTPYFIPTEVLPLSKRFGPWAAVTGSTDGIGKEYALELARLGMNVVLISRSEDKLRTVSREIEKLHGVKTKIIVADFSKGTEIYQNIENGLKDVPLGILEKLHGVKTKIIVADFSKGTEIYQNIENGLKDVPLGILAVTGSTDGIGKEYALELARLGMNVVLISRSEDKLRTVSREIEKLHGVKTKIIVADFSKGTEIYQNIENGLKDVPLGILVNNVGVQYEYPMPLVELPVSKAWELISVNVVAVTTLTRMVLPGMLARGRGAVVNVSSGSELQPLPLMAVYAATKSYVRSLTLALRAEVSPTVTVQHVSPLFVSTKMNTFSPTLLAGNPLVPDARTYARHAVRTLGRVTATSGYWVHGVQSFFIKLAPEWVRIKVGAQMNREFREEHMRAIKRQ-