Monarch geneset OGS2.0

DPOGS208763
TranscriptDPOGS208763-TA819 bp
ProteinDPOGS208763-PA272 aa
Genomic positionDPSCF301125 + 2815-4431
RNAseq coverage33x (Rank: top 75%)
Annotation
HeliconiusHMEL0174182e-8653.18% 
BombyxBGIBMGA012400-TA3e-8552.03% 
DrosophilaSodh-2-PA8e-7952.77% 
EBI UniRef50UniRef50_Q029123e-10464.04%Sorbitol dehydrogenase n=1 Tax=Bombyx mori RepID=DHSO_BOMMO
NCBI RefSeqNP_001037311.16e-10564.04%sorbitol dehydrogenase [Bombyx mori]
NCBI nr blastpgi|1129837441e-10364.04%sorbitol dehydrogenase [Bombyx mori]
NCBI nr blastxgi|1129837444e-10364.04%sorbitol dehydrogenase [Bombyx mori]
Group
Gene OntologyGO:00082701.9e-83zinc ion binding
GO:00551141.9e-83oxidation-reduction process
GO:00164911.9e-83oxidoreductase activity
GO:00054885.6e-34binding
KEGG pathwayaag:AaeL_AAEL0111123e-79 
 K00008 (E1.1.1.14, gutB)maps-> Fructose and mannose metabolism
InterPro domain[1-262] IPR0020851.9e-83Alcohol dehydrogenase superfamily, zinc-type
[1-145] IPR0110322e-43GroES-like
[138-260] IPR0160405.6e-34NAD(P)-binding domain
[2-104] IPR0131544.8e-25Alcohol dehydrogenase GroES-like
[145-262] IPR0131491.3e-19Alcohol dehydrogenase, C-terminal
Orthology groupMCL27825 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208763-TA
ATGGATTGTGTCGGAATATGTGGTTCTGACCTCAAGCTGTACTCCATGGGGCGCTGTGGATCCGAGAAGTTATCCAAACCCATTGTTATGGGCCACGAGGGAGCTGGCGTCGTAGTTCAGGTAGGGAGTCAGGTGACCACTTTGGTAGCTGGTGACCGCGTGGCTATTGAGCCGACCCAGCCTTGCCGTAGCTGCACGTTCTGCCGCAGCGGCAGGTACAACGTGTGCGAACAACCACGATACTGTTCCACTGACGGAGCGGACGGAAACCTCTGCACCTACTACAAACATGTAGCTGACTTCTGTCACAAGATACCGGACAATGTGACAATGGAGGAAGGTGCAGCCACCCAGCCACTAGCGATCGCGGTGCATGCGTGCTCACGCGCAGGAATACAACTAGGCAGTACACTGCTGATTATGGGCGCAGGCCCTGTGGGCCTATTATGCGCCATCACCGCACGGGCCATGGGCGTCGCTAAGATACTTATGACCGACATGGTCGCCTCCCGGATCGAGATTGCTAAGAGGTTGGTTGCAGATCATACCTTGCTAATTAAATCGGAATACAACGAAGAGGACATCGTGAAGCGTGTGACAGAGACGCTGGGCGGCCCGCCCGACGTCACGATAGACGCGTGCGGACACGAAACAGCGCAACGTGTGGCGTTGATGGTAACAAAGACTGGCGGAGTTGTGCTGGTGGTAGGAATCGGTGAGGGGCTGGTGTCTGTGCCGCTGAGTTCTGCTCTTCTACGAGAGGTCGACATTCGAGGATCCTACAGGCTGCTTAACTCGTATGACACTTGGATATATTGA

Protein sequence:

>DPOGS208763-PA
MDCVGICGSDLKLYSMGRCGSEKLSKPIVMGHEGAGVVVQVGSQVTTLVAGDRVAIEPTQPCRSCTFCRSGRYNVCEQPRYCSTDGADGNLCTYYKHVADFCHKIPDNVTMEEGAATQPLAIAVHACSRAGIQLGSTLLIMGAGPVGLLCAITARAMGVAKILMTDMVASRIEIAKRLVADHTLLIKSEYNEEDIVKRVTETLGGPPDVTIDACGHETAQRVALMVTKTGGVVLVVGIGEGLVSVPLSSALLREVDIRGSYRLLNSYDTWIY-