Monarch geneset OGS2.0

DPOGS207507
TranscriptDPOGS207507-TA2664 bp
ProteinDPOGS207507-PA887 aa
Genomic positionDPSCF300177 - 370847-385773
RNAseq coverage1857x (Rank: top 7%)
Annotation
HeliconiusHMEL0179574e-7881.21% 
BombyxBGIBMGA001929-TA0.082.08% 
DrosophilaCG3415-PA4e-13755.78% 
EBI UniRef50UniRef50_Q9VXJ06e-13555.78%CG3415 n=34 Tax=Neoptera RepID=Q9VXJ0_DROME
NCBI RefSeqXP_974784.11e-14760.00%PREDICTED: similar to estradiol 17 beta-dehydrogenase [Tribolium castaneum]
NCBI nr blastpgi|910858173e-14660.00%PREDICTED: similar to estradiol 17 beta-dehydrogenase [Tribolium castaneum]
NCBI nr blastxgi|910858173e-14160.00%PREDICTED: similar to estradiol 17 beta-dehydrogenase [Tribolium castaneum]
Group
Gene OntologyGO:00054883.9e-56binding
GO:00081521.7e-30metabolic process
GO:00164911.7e-30oxidoreductase activity
GO:00329341.1e-27sterol binding
KEGG pathwaytca:6636554e-147 
 K12405 (HSD17B4)maps-> Peroxisome
    Primary bile acid biosynthesis
InterPro domain[5-235] IPR0160403.9e-56NAD(P)-binding domain
[10-180] IPR0021981.7e-30Short-chain dehydrogenase/reductase SDR
[629-744] IPR0025395.4e-30MaoC-like dehydratase
[769-885] IPR0030331.1e-27SCP2 sterol-binding domain
[10-27] IPR0023479.8e-24Glucose/ribitol dehydrogenase
Orthology groupMCL14320 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207507-TA
ATGGATCAATTAAATTTCGCTGGCCGCGTAGCTGTCGTAACAGGAGCCGGTGGTGGCCTAGGAAAGGCCTACGCTTTACTTCTTGGCTCCAGGGGTGCTAAAGTGGTGGTAAACGATCTCGGCAGTGCTCGGGATGGCGTTGGAAAATCAAAATTCGCAGATCTTGTTGTACAGGAAATTAAGGACAAAGGTGGTATCGCTGTCGCCGATTACAACTCGGTGGTTGAAGGAGAGAAGATTATAAAAACAGCTCTCGATAACTTCGGAAGGGTTGACATTCTTATCAACAATGCCGGTATTTTACGTGATAAGAGCTTTCAAAAAATGTCCGAACAGGATTGGGACCTGATCCAAGCTGTGCATCTGAAAGGCGCTTACAAAACAACTCAAGCTGCGTGGGAGACTTTCAAGAAACAGAAATATGGTCGCATCATCATGACGACCAGTAATGCTGGTTTGTTCGGGAACTTTGGCCAGGCCAATTACAGCGCCGCAAAGATGGGTTTAGTCGGTCTGGCCAGTACATTGGCCATTGAGGGCGCTAAATACAACATTAAAGTGAATACTATAGTACCGACAGCAGCGTCCAGACTCACAGAAGACATCCTGCCGCCAGAGATGTTCCAAGCGATGAAACCTGATCTCATCGCACCCGTTGTAGCCTACATGGTGCATGAGTCCTTCACTGACAGCGGTCTCATCATTGACTCGACATTGGGTTTCGCGACCAAGACTCATCTTGTACGCTCACAAGGAGCACCCTTGAGGAAGAAACCGTCGGATCCGGTGACAATAGAGTCGGTTAGAGACAATTGGTCCCAGGTCGTGAACATGGAGAACGCTCAGCGTATAGAGAAGATAGCTGAAGTTACGGTCGACCTCGTGGAGAAATTACAGGACTTTGAGGAGCGTTGTAAGTTGGATGGTCAGAATGAGAGTTACTGGGGGAAGTACAAGTATAACTCCAAAGACCTGGTCTGCTACGCCCTAGGAGTTGGTGCGTCGGTGGTGAACCCTTCAGACCTCAAGTTCCTATACGAGAGCCACGAGAGTTTCTCAGCTCTACCCACATTCTTCATACTCCCCGGTATGTTGATGGAATCACCTGTAGTGGGTAAAGCGATGCCTCCAGGAAAATATGCTGACTTTACAAATATTCTCCATGGCGAGCAGTACATCGAATTTCTGTCAGACCTGCCCGGCACTGAGGGAGAGTTCACTGTACGGAACTATGTAGTGGACTTACTGGACAAGGGATCCAGCGCCGTGGCTGCCTACATGGTGCATGAGTCCTTCACTGACAGCGGTCTCATCATTGACTCGACATTGGGTTTCGCGACCAAGACTCATCTTGTACGCTCACAAGGAGCACCCTTGAGGAAGAAACCGTCGGATCCGGTGACAATAGAGTCGGTTAGAGACAATTGGTCCCAGGTCGTGAACATGGAGAACGCTCAGCGTATAGAGAAGATAGCTGAAGTTACGGTCGACCTCGTGGAGAAATTACAGGACTTTGAGGAGCGTTGTAAGTTGGATGGTCAGAATGAGAGTTACTGGGGGAAGTACAAGTATAACTCCAAAGACCTGGTCTGCTACGCCCTAGGAGTTGGTGCGTCGGTGGTGAACCCTTCAGACCTCAAGTTCCTATACGAGAGCCACGAGAGTTTCTCAGCTCTACCCACATTCTTCATACTCCCCGGTATGTTGATGGAATCACCTGTAGTGGGTAAAGCGATGCCTCCAGGAAAATACGCTGACTTTACAAATGTACGTGAAATATACCAAAACAAAGAGCTGGTTATCAGAACTCAACAGCACATTTTCGTCCTGGGTCAAGGTGGGTTTGGTGGACCGAGGAACAGTAAACAGGCTATAGCAGTAGAAGCTGTTCCAAAGAGATCACCTGATGCTGTACTGGAACAAAGGACAGCTGAAGATCAAGCTGCTTTATACAGATTGTCAGGAGATTTGAATCCACTACACATTGATCCTAATGTGGCCACAGCCAGCGGTCACCCGAGACCAATTCTGCATGGTCTTGCATCACTCGGTTTCTCAGCTAGACATGTTCTCATGAAATACGCAGGAAATGATGCTTCAAATGTCAAAGCTCTGAAGGCTAGATTCGCCAAGCCGGTGTTGCCAGGACAGACGTTAATTACAGAAATGTGGTTGGAAGGAAAGCGGGTTCACTTCCAGACTAAGTTGAAGGAGACTGGCAACATTGTTATTGCCAGTTCCTACATGGATTTGAAGAATGTTATCAAAGACGGCGCACCTGCATCCAACCAAAAGATGGCAGCACCATCCGCTTCGTCGTTGAAGAGTGACTCTTTGTTTGCCAAAATTGAGGATGGCATTAAAGCTAACCCGGACAAGGCCAAGAGTGTTGGCGCTGTTTACTTGTACAATATCACTCTCAACGGAAAAACTGTGAAGCAGTGGACTCTGGATTTAAAATCGGCATTGGCAGTATACCAAGGTGAACCTAAAAGTGGTAAAGCAGATACAACAATGACGGTGTCAGACGATGACCTTATGGAAATTGCTGCTGGCACACTCAGCCCTCAAGTTGCTTACCTAAAGGGAAGGTTAAAGATTTCTGGAAACATTATGTTGGCTCAAAAACTCGGACCTTTGCTTAAGAGTGAAGCTAAATTATAA

Protein sequence:

>DPOGS207507-PA
MDQLNFAGRVAVVTGAGGGLGKAYALLLGSRGAKVVVNDLGSARDGVGKSKFADLVVQEIKDKGGIAVADYNSVVEGEKIIKTALDNFGRVDILINNAGILRDKSFQKMSEQDWDLIQAVHLKGAYKTTQAAWETFKKQKYGRIIMTTSNAGLFGNFGQANYSAAKMGLVGLASTLAIEGAKYNIKVNTIVPTAASRLTEDILPPEMFQAMKPDLIAPVVAYMVHESFTDSGLIIDSTLGFATKTHLVRSQGAPLRKKPSDPVTIESVRDNWSQVVNMENAQRIEKIAEVTVDLVEKLQDFEERCKLDGQNESYWGKYKYNSKDLVCYALGVGASVVNPSDLKFLYESHESFSALPTFFILPGMLMESPVVGKAMPPGKYADFTNILHGEQYIEFLSDLPGTEGEFTVRNYVVDLLDKGSSAVAAYMVHESFTDSGLIIDSTLGFATKTHLVRSQGAPLRKKPSDPVTIESVRDNWSQVVNMENAQRIEKIAEVTVDLVEKLQDFEERCKLDGQNESYWGKYKYNSKDLVCYALGVGASVVNPSDLKFLYESHESFSALPTFFILPGMLMESPVVGKAMPPGKYADFTNVREIYQNKELVIRTQQHIFVLGQGGFGGPRNSKQAIAVEAVPKRSPDAVLEQRTAEDQAALYRLSGDLNPLHIDPNVATASGHPRPILHGLASLGFSARHVLMKYAGNDASNVKALKARFAKPVLPGQTLITEMWLEGKRVHFQTKLKETGNIVIASSYMDLKNVIKDGAPASNQKMAAPSASSLKSDSLFAKIEDGIKANPDKAKSVGAVYLYNITLNGKTVKQWTLDLKSALAVYQGEPKSGKADTTMTVSDDDLMEIAAGTLSPQVAYLKGRLKISGNIMLAQKLGPLLKSEAKL-