Monarch geneset OGS2.0

DPOGS211391
TranscriptDPOGS211391-TA1053 bp
ProteinDPOGS211391-PA350 aa
Genomic positionDPSCF300115 - 568759-570850
RNAseq coverage3225x (Rank: top 4%)
Annotation
HeliconiusHMEL0057055e-10158.17% 
BombyxBGIBMGA012527-TA4e-5439.81% 
DrosophilaCG7675-PB1e-5036.01% 
EBI UniRef50UniRef50_B3SA212e-5740.31%Putative uncharacterized protein n=3 Tax=Trichoplax adhaerens RepID=B3SA21_TRIAD
NCBI RefSeqXP_002115353.13e-5843.15%hypothetical protein TRIADDRAFT_50666 [Trichoplax adhaerens]
NCBI nr blastpgi|1960109786e-5743.15%hypothetical protein TRIADDRAFT_50666 [Trichoplax adhaerens]
NCBI nr blastxgi|1960109781e-5543.15%hypothetical protein TRIADDRAFT_50666 [Trichoplax adhaerens]
Group
Gene OntologyGO:00054888.7e-62binding
GO:00081521.8e-19metabolic process
GO:00164911.8e-19oxidoreductase activity
KEGG pathway 
InterPro domain[51-339] IPR0160408.7e-62NAD(P)-binding domain
[60-207] IPR0021981.8e-19Short-chain dehydrogenase/reductase SDR
[61-78] IPR0023476e-19Glucose/ribitol dehydrogenase
Orthology groupMCL21137 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211391-TA
ATGGAGTATCTCTTTCATGAGGAAAGTGGGCAGGAACTCCTTCACTGGATGGTACGGCATCAGGCATTCACTGCAAACGCTTCAAGCAGGAATTTATTCAGCACAACAGTAACGCAAGCAAAGTTGCATACGAAGAACACGAATTCGATATGTCGTGCAAGGACGAGGTTGGATGGAAAAACTGCTATCATAACCGGCGGGACATTGGGCATGGGTCTGAATGTAGCCACCGACTTCGCCGATCGGGGAGCGCGCGTCATCATCGCATGCCCCTTCGAACCTGAAGGAAGAAACGCCCAGAGGCTCATCGAAGAAGAGACGGGAAATAAACAAATTATATTTAAATTTCTCGACCTGTCGTCGTTTAAGTCGGTGCGAGATTTTGCCTCTGACATACTTAATAGGGAAGATAGACTGGACATTTTAGTGAACAATGCAGGCGTCGGCTACCTAGAAGACGATGTGACCAAAGATGGACTAAATGTGATTTTGCAAATTAATTACTACGGACATTTTCTTTTAACTTTGTTACTGCTGCCCCTTCTCAAACGAACCGGCACTAAGTCGGAACCCGCCAGAGTAGTGAACGTGTCGTCGCTACTACACTACTTGGGTACTATGAATGCATGTTACATAAAACGCTCCAAGGCGTTGCACGCGCTGCAACTGTATTCAGACAGCAAATTTTTCTTAATGGTATTTACACGCGAGCTATCAAAGAAATTGAGTGAATCTAACGTTGTCGTGAATTGCGTGGATCCGGGCGCTGTTGGGACGGAGATATTCTACAGTATAGGATGTGTGTGGGGGCCTCTCATCAAATGCTTCTTCTCCACGATCTTCAAGACACCCTGGGAAGGAGCACAGACTACGATTCACGTGGCTCTGGATAAGAAGGCTGGCACTATAAGTGGTCAAATGTTCAAAAACTGTCAAGTGTCACATGCAAAACAATCGGCCTTTAGTGAAATAAACCGTAAACGGTTGTGGGACGACTCTATTAAGCTCGTCAAACTCACTGAAGCGGAACATCAACAGTGTCTGTACACATGA

Protein sequence:

>DPOGS211391-PA
MEYLFHEESGQELLHWMVRHQAFTANASSRNLFSTTVTQAKLHTKNTNSICRARTRLDGKTAIITGGTLGMGLNVATDFADRGARVIIACPFEPEGRNAQRLIEEETGNKQIIFKFLDLSSFKSVRDFASDILNREDRLDILVNNAGVGYLEDDVTKDGLNVILQINYYGHFLLTLLLLPLLKRTGTKSEPARVVNVSSLLHYLGTMNACYIKRSKALHALQLYSDSKFFLMVFTRELSKKLSESNVVVNCVDPGAVGTEIFYSIGCVWGPLIKCFFSTIFKTPWEGAQTTIHVALDKKAGTISGQMFKNCQVSHAKQSAFSEINRKRLWDDSIKLVKLTEAEHQQCLYT-