Monarch geneset OGS2.0

DPOGS215414
TranscriptDPOGS215414-TA1683 bp
ProteinDPOGS215414-PA560 aa
Genomic positionDPSCF300088 + 644216-646885
RNAseq coverage10x (Rank: top 84%)
Annotation
HeliconiusHMEL0174350.066.07% 
BombyxBGIBMGA012375-TA1e-8161.67% 
DrosophilaCG7675-PB3e-5037.29% 
EBI UniRef50UniRef50_G6D2795e-5643.12%Putative RDH13 n=2 Tax=Eumetazoa RepID=G6D279_DANPL
NCBI RefSeqXP_798545.22e-5438.00%PREDICTED: similar to RDH13 [Strongylocentrotus purpuratus]
NCBI nr blastpgi|2608368053e-5339.86%hypothetical protein BRAFLDRAFT_68404 [Branchiostoma floridae]
NCBI nr blastxgi|2608368058e-5539.37%hypothetical protein BRAFLDRAFT_68404 [Branchiostoma floridae]
Group
Gene OntologyGO:00054886e-58binding
GO:00081525.6e-21metabolic process
GO:00164915.6e-21oxidoreductase activity
KEGG pathway 
InterPro domain[264-550] IPR0160406e-58NAD(P)-binding domain
[276-417] IPR0021985.6e-21Short-chain dehydrogenase/reductase SDR
[276-293] IPR0023471.9e-11Glucose/ribitol dehydrogenase
Orthology groupMCL25957 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215414-TA
ATGCGGGCTAGAGACAAAATAGAAAAGATATCAGGGAACAGAAATGTGTCTTACAAGCTTTTGGACTTAGCTTCTTTGACATCAGTGAGGAGTTTTGTAAGCGACACCATCAACACTGAAGAAAGGTTAGACATTTTAGTCAACAATGCTGGGGCTGTGGGACTCCCTGACAGATTGACGGAAGATGGCCTTAATCTTACCATGCAAGTAAATTTTTTTGGTACATTCCTACTGACGTATCTATTGCTGCCTCTGTTAAAAAGTTCAGCACCGAGTCGAATAATAAACAGTTCAGCGTCGTCGATGTACGTCGGACATATCGACTTCGAACAGTGGAATGATATTGAACGATACACTCCCATAGAGGTACTGGCCAACTCAAAACTAGCAGTGACTCTGTTTAGCGCAGAATTAAGTGCAAGGCTAAAAGAATACGGGGTGTCTGTGAACACTTACGATCCCTTTGTTGTTAAAGATACTGATATATTGGATAATCTTCCAAGCTTATTAAAAAATATAACCCAGCTCTTTGTGGATATTGTTGGACAACCCAAGGAAGACGTTGGAAGAGAAATCGCTTACTTAGCATCCGATCCACAGTTTGAAAAAGTCAGTGGCGAGCATTATAAATTCTGTAAAAAGTGGATAAATCATTGGTTAGTCGATGACAGGCATCTGACGAGAGAATTGTGGGAAGTTTCAAAAAAAAATGTAAATATATCTACTATCAGTGTTATTTTTGCTGTCTGTGAACACGGTCCGGTTGTGATGCGGGTCCAGGACGACTTCGCAAACTGTGAAAACAATGTTCGTATGGATGGACAAGTGGCCATCGTTACTGGAGCTACTGCCGGTATAGGGTTTGAAGTTGCCAAAAATTTTGCGAAAAGAGGAGCCAGAGTCATAATAGGCAGCAGGAACCCTGCTAAAATGGATAAAGCAAAAAACGCAATCATACAAAGCTCTGGAAATACGAACATTTCAACCAGAAAACTCGACTTCGCCTCTTTAAAATCTATACGAAGGTTCGCAAGTGCAATATATATGAGTGAACCCAAATTAAACATACTAATCAACAACATCGGCGCCCTCGGCTTACCAGATCGACTGACGAAGGACAAACTTAATCTGATGATGCAAGTTAACTACTTTGGGGCCTTCCTGCTAACCTTCTTGCTATTGCCTTTATTAAGGACTTCAGCCCCGAGTAGGATCGTGAATGTGTCGTCTATAACTCTACTTCTGGGTCACATTGAACTCGATCATATGAACGATGTTGGAAGATTTTCAAGCTTCGGAATGTATTGCAACTCAAAGTTAGCAGATATATTGTTTACTGTTGAAATGAACAAACGGATACGAGGCAGTGGCGTTAACGTGTACAGTATGGACCCCGGGCTGAGTAAGTCTGAGTTCTTCAGAGATTTTAACGACACAACCTTAAGAAATGTTTTTAATGCAGGCATGTTGTTATTGGGACGCGATTTAGATAGGGTTGCTACAATGCCAGTGTTTTTAGCGACAGATCCGAGGGTCCAAAATTCGAGCGGTAAGCATTTCAGAGACTGCGCTGAATTCTACAGTTCGTGGTTTGCTGAAGACGCTGACTTAACTCGGAAACTGTGGGAGGAATCAAAACGATTAGTAAACATCACCACTCAAGAGGACTGGGAACTTAAATAG

Protein sequence:

>DPOGS215414-PA
MRARDKIEKISGNRNVSYKLLDLASLTSVRSFVSDTINTEERLDILVNNAGAVGLPDRLTEDGLNLTMQVNFFGTFLLTYLLLPLLKSSAPSRIINSSASSMYVGHIDFEQWNDIERYTPIEVLANSKLAVTLFSAELSARLKEYGVSVNTYDPFVVKDTDILDNLPSLLKNITQLFVDIVGQPKEDVGREIAYLASDPQFEKVSGEHYKFCKKWINHWLVDDRHLTRELWEVSKKNVNISTISVIFAVCEHGPVVMRVQDDFANCENNVRMDGQVAIVTGATAGIGFEVAKNFAKRGARVIIGSRNPAKMDKAKNAIIQSSGNTNISTRKLDFASLKSIRRFASAIYMSEPKLNILINNIGALGLPDRLTKDKLNLMMQVNYFGAFLLTFLLLPLLRTSAPSRIVNVSSITLLLGHIELDHMNDVGRFSSFGMYCNSKLADILFTVEMNKRIRGSGVNVYSMDPGLSKSEFFRDFNDTTLRNVFNAGMLLLGRDLDRVATMPVFLATDPRVQNSSGKHFRDCAEFYSSWFAEDADLTRKLWEESKRLVNITTQEDWELK-