Monarch geneset OGS2.0

DPOGS216032
TranscriptDPOGS216032-TA1623 bp
ProteinDPOGS216032-PA540 aa
Genomic positionDPSCF300067 - 588442-592441
RNAseq coverage92x (Rank: top 62%)
Annotation
HeliconiusHMEL0089450.076.16% 
BombyxBGIBMGA009014-TA0.074.71% 
DrosophilaCG11257-PA8e-5330.69% 
EBI UniRef50UniRef50_D6WYW58e-9035.30%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WYW5_TRICA
NCBI RefSeqXP_002112919.12e-6833.09%hypothetical protein TRIADDRAFT_56539 [Trichoplax adhaerens]
NCBI nr blastpgi|2700114063e-8935.30%hypothetical protein TcasGA2_TC005424 [Tribolium castaneum]
NCBI nr blastxgi|2700114068e-8635.30%hypothetical protein TcasGA2_TC005424 [Tribolium castaneum]
Group
Gene OntologyGO:00200373.5e-26heme binding
GO:00551141.2e-20oxidation-reduction process
GO:00164911.2e-20oxidoreductase activity
KEGG pathwaydre:5536852e-70 
 K00326 (E1.6.2.2)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[19-95] IPR0011993.5e-26Cytochrome b5
[286-403] IPR0179381.2e-20Riboflavin synthase-like beta-barrel
[301-400] IPR0083331.2e-17Oxidoreductase, FAD-binding domain
[412-518] IPR0014333.2e-16Oxidoreductase FAD/NAD(P)-binding
[325-336] IPR0018343.8e-12NADH:cytochrome b5 reductase (CBR)
Orthology groupMCL13854 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216032-TA
ATGGACTGGATCAGGCTGGGAAATTCGGGGAAAGACTTGAACGGAATCGGTGGCCGCATCAGACCCGTTACACCCACGGAACTGGCAACACACAATACACAGGAGGACGCGTGGCTCGCAATTAGGGGACGAGTTTATAATATAACGTATTATTTACCCTACCATCCCGGAGGTCCTGAAGAATTAATGCGTGGCGCTGGTATTGATGCAACGGAACTCTTTGATAAAGTTCACCCATGGGTCAACTACGATTCGCTCTTAGCAAAGTGCTTAGTGGGACCGCTTCGTACCGACAGACCTGATGCTGAGGAACTTTTTGACATCGCAAATGCATCTCCAAAATCTGACCGTCTTAGAGAACCGTCAAAAGCTCAAGAACTCGTTAGAAAATCAATGGAAAATTTAGCAAACTGTATAACACCTGTAAGGAAAAAGATTACCACTAAAAGCGAAGACAACGTTAAAGGAAGTCCTCCAAGCAAAATCATGCAAAGTCTAATACAGTCAAGTGATCTCCCAGTATCAATAAGCCGTAGAGCTGCGTCGAGTCCAATTAAGTCTGACAAAGCACAAGGGTCACCGCCTCCTTTGCGTTTCGACTGGATACAAACATCAATGAAACTTACAATCTCAATATATACTGGCCTTTTAGCTAATCCAGGTGGATGTGCAAAGATAACAGACGGTCGATTACACGTGGAAGTAGCTACTGACGGATGGCTTAGAACGGTGCAATTAATACCAGATGAACAAATCAAAGACCCTTTGCAAATGAGGGTTTTTTCTGAGAGTGGAAAAATTGAGGTAATGGCGTCTAAAGTGGAACACAAGGTTTGGAAGAACGTGGGTGTGATGTCATTCGGTGCGGCCACTCAGATATCGTCTCCGCGGACATTGGATTGTCGGGTGATGCATGTGGAGCGAGTCTCCCATGACACTACTTTACTTGCTTTGTCACCTGTGTCTGGGCCGGTAATTGTACCACTAGGGCACCATATTAGAGTTCACCATAAAATAAACGATAAGGAATGTATACGTTCGTACACCCCCGTGGGAGACGGGTGGGACAATTTTGGCGGTGACCTGAGCGCATTGAAATTGGCGATTAAGAGATATGATACGGGGGATCTATCACCGTATCTCACGTCCTTGAAGTTGGGCGATCTCGTCACGCTTTCAGGACCGTATGGAAATTTTCAATTACAGAAGCTAAAACAAGTAAAAGTAATGTATTTGATAGCCGCAGGCAGTGGGATAACGCCAATGTTGGGCTTATTGAGATTTATGCTGGCGCGATCTAACCTCAAATGCAAACGAGTCCACTTGATTTTCTTCAACAAAACAGAAGAAGACATACTTTTCAAAGAAAAAATCAATGAAATAATAAAACAAGACGACAGATTAAATGTCATTCACGTCTTATCAAACGCCAGCTCGTCTTGGACGGGACTTAAGGGCCGGATTAGCAGCGAGATCCTTTCTCTAGTTATAGATAAAGAGCTTTACACACAGGAGCAACATTTCGCATGCCTATGTGGCAAAACAGAGTTTACGCAAGCCGGTCTAGAGATTTTAAGGAGGTTGGGTGTGAAAGAGAACGATGTACATGCGTTTATTGGATAA

Protein sequence:

>DPOGS216032-PA
MDWIRLGNSGKDLNGIGGRIRPVTPTELATHNTQEDAWLAIRGRVYNITYYLPYHPGGPEELMRGAGIDATELFDKVHPWVNYDSLLAKCLVGPLRTDRPDAEELFDIANASPKSDRLREPSKAQELVRKSMENLANCITPVRKKITTKSEDNVKGSPPSKIMQSLIQSSDLPVSISRRAASSPIKSDKAQGSPPPLRFDWIQTSMKLTISIYTGLLANPGGCAKITDGRLHVEVATDGWLRTVQLIPDEQIKDPLQMRVFSESGKIEVMASKVEHKVWKNVGVMSFGAATQISSPRTLDCRVMHVERVSHDTTLLALSPVSGPVIVPLGHHIRVHHKINDKECIRSYTPVGDGWDNFGGDLSALKLAIKRYDTGDLSPYLTSLKLGDLVTLSGPYGNFQLQKLKQVKVMYLIAAGSGITPMLGLLRFMLARSNLKCKRVHLIFFNKTEEDILFKEKINEIIKQDDRLNVIHVLSNASSSWTGLKGRISSEILSLVIDKELYTQEQHFACLCGKTEFTQAGLEILRRLGVKENDVHAFIG-