Monarch geneset OGS2.0

DPOGS211657
TranscriptDPOGS211657-TA906 bp
ProteinDPOGS211657-PA301 aa
Genomic positionDPSCF300325 + 257707-259826
RNAseq coverage307x (Rank: top 37%)
Annotation
HeliconiusHMEL0063032e-15082.55% 
BombyxBGIBMGA011707-TA7e-12475.19% 
DrosophilaCG8768-PA3e-7345.39% 
EBI UniRef50UniRef50_Q1HPI34e-12472.67%C14orf124 protein n=5 Tax=Arthropoda RepID=Q1HPI3_BOMMO
NCBI RefSeqNP_001040495.17e-12572.67%epimerase family protein SDR39U1 [Bombyx mori]
NCBI nr blastpgi|1140528371e-12372.67%epimerase family protein SDR39U1 [Bombyx mori]
NCBI nr blastxgi|1140528377e-12772.67%epimerase family protein SDR39U1 [Bombyx mori]
Group
Gene OntologyGO:00054885.2e-15binding
GO:00442373.7e-09cellular metabolic process
GO:00038243.7e-09catalytic activity
GO:00506623.7e-09coenzyme binding
KEGG pathway 
InterPro domain[4-298] IPR0100991.1e-124Sugar nucleotide epimerase YfcH, putative
[2-239] IPR0160405.2e-15NAD(P)-binding domain
[247-293] IPR0135496.3e-14Domain of unknown function DUF1731, C-terminal
[6-218] IPR0015093.7e-09NAD-dependent epimerase/dehydratase
Orthology groupMCL14054 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211657-TA
ATGGCGGCTAAAAAAGTTTTGATTGGTGGCGGCACGGGCTTTGTCGGAACTCGCTTAAATAGTCTCTTAAAAACTAACAACTATTGTGTGGTGAACATATCTCGTATGCCGGGTGCCAACAACATTTCCTGGTCTGCTTTGGAGACTTCAGGATTGCCTTCACAGACATGTGCTGTAGTAAATCTGGCAGGTCAACAATTCATGGATTTTACTAAAATGTGGACACCAGGTTTTAAACAAAATGTAAAAACATCCAGAGTATATACAACAAAGGCAATTGCAAGTGCTATTAACAAAAGCCAAGATAAACCTAGAGTATTTGTGTTGATCACCGGAGTAGGTGCTTATGAGCCATCAGAGACTATAAAGTATGATGAATCGAGTCCTACAACAGGCAATGACTTTTTCTCTAAATTGCTAGTTGAATGGGAACAAGCTGCAAAAGTTGACCCACCCGTAAGGCTGGTTATAATACGTTCAGGTGTTGTACTTGGTCGTGAAGGTGGTATGATAAAGAACATGATAGTACCATTTTTCTTCGGACTTGGTGGTCCCATAGCATCCGGGAAACAGTTCCTGCCATGGATCCACATAGAAGATCTCATTAGACTGATACAATTTGCCATAGAAAATGAAAATGTCAAAGGAGTCTTAAATGGTGTGGCTCCTCAGGTTATTACAAATGCAGAATTTACTAAGTCGTTCGCTAAAGCCTTGTCACGTCCTGCGTTCTTCACTGTACCAGAATTTTCGTTAAACCTCCTATTGAATCCAGAAAGGGCTATGATGTTGACCAAAGGTCAATATGTCGTGCCAAAACGAACTTTAGAATATGGCTTCCAGTACAAATATTGCACAATAGACGAAGCTTGCGCTGAATGTGCACATTTATTCTCAAAAAATTAA

Protein sequence:

>DPOGS211657-PA
MAAKKVLIGGGTGFVGTRLNSLLKTNNYCVVNISRMPGANNISWSALETSGLPSQTCAVVNLAGQQFMDFTKMWTPGFKQNVKTSRVYTTKAIASAINKSQDKPRVFVLITGVGAYEPSETIKYDESSPTTGNDFFSKLLVEWEQAAKVDPPVRLVIIRSGVVLGREGGMIKNMIVPFFFGLGGPIASGKQFLPWIHIEDLIRLIQFAIENENVKGVLNGVAPQVITNAEFTKSFAKALSRPAFFTVPEFSLNLLLNPERAMMLTKGQYVVPKRTLEYGFQYKYCTIDEACAECAHLFSKN-