Monarch geneset OGS2.0

DPOGS209858
TranscriptDPOGS209858-TA1344 bp
ProteinDPOGS209858-PA447 aa
Genomic positionDPSCF300451 + 64114-73448
RNAseq coverage140x (Rank: top 55%)
Annotation
HeliconiusHMEL0169731e-17491.24% 
BombyxBGIBMGA002346-TA0.081.34% 
DrosophilaCtBP-PE0.083.62% 
EBI UniRef50UniRef50_O460360.083.62%C-terminal-binding protein n=88 Tax=Bilateria RepID=CTBP_DROME
NCBI RefSeqXP_972241.10.080.69%PREDICTED: similar to 2-hydroxyacid dehydrogenase [Tribolium castaneum]
NCBI nr blastpgi|910903120.080.69%PREDICTED: similar to 2-hydroxyacid dehydrogenase [Tribolium castaneum]
NCBI nr blastxgi|910903120.081.34%PREDICTED: similar to 2-hydroxyacid dehydrogenase [Tribolium castaneum]
Group
Gene OntologyGO:00054882.7e-64binding
GO:00166162e-55oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00480372e-55cofactor binding
GO:00551142e-55oxidation-reduction process
GO:00081521.3e-29metabolic process
GO:00512871.3e-29NAD binding
KEGG pathwaytca:6609540.0 
 K04496 (CTBP)maps-> Pathways in cancer
    Wnt signaling pathway
    Chronic myeloid leukemia
    Notch signaling pathway
InterPro domain[142-318] IPR0160402.7e-64NAD(P)-binding domain
[134-318] IPR0061402e-55D-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding
[32-352] IPR0061391.3e-29D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain
Orthology groupMCL11354 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209858-TA
ATGGACAAACGCAAGATGCTGCCAAAGAGAGCGCGCATGGATAGCATGCGGGGTCCCATCGCTAACGGACCACTGCAGTCGAGGCCCCTGGTGGCGTTGCTCGACGGTAGAGACTGCACCGTCGAGATGCCCATACTGAAGGACGTGGCTACCGTCGCCTTCTGCGACGCACAGTCCACATCCGAAATACACGAGAAGGTGCTGAACGAGGCAGTCGGCGCTCTCATGTGGCACACCATCATACTCACCAAGGAGGACCTGGAGAAGTTCAAGGCGCTCAGGATCATTGTGCGCATCGGCTCCGGCGTAGACAACATCGACGTCAAGGCCGCCGGCGAGTTGGGCATAGCGGTTTGTAACGTGCCGGGTTACGGCGTTGAGGAGGTGGCCGACACCACCATGTGTCTCATACTGAACCTCTACAGACGGACGTACTGGCTGGCCAACATGGTGCGCGAGGGGAAGAAGTTCACAGGTCCGGAGCAGGTCCGCGAGGCGGCGGCCGGCTGTGCCCGTATCCGCGGCGACACGCTGGGTATCGTGGGTCTGGGCCGGATCGGCTCGGCCGTGGCGCTCCGGGCGAAGGCCTTCGGCTTCAACGTCATCTTCTACGACCCCTACCTGCCCGACGGCATCGAGAAGTCGCTGGGGCTCACCAGGGTCTACACGCTGCAGGATCTACTATTCCAGAGTGACTGTGTGTCATTGCACTGCAGCTTAAACGAACACAATCACCATCTTATTAATGAATTTACTATCAAACAAATGCGTCCAGGGGCGTTCCTGGTGAACACGGCCCGCGGCGGGCTGGTCGACGACGAGGGTCTAGCGGCGGCTCTCAAACAGGGACGGATCCGGGCGGCGGCGCTAGACGTGCACGAGAACGAACCCTTCAACGTCTTCCAGGGTCCGCTGAAGGAGGCTCCCAACGTCCTGTGCACGCCTCACGCCGCCTTCTACTCGGACGCCTCCGCCCAGGAACTGAGAGAAATGGCCGCCTCCGAGATACGACGAGCTATCGTCGGACGTATACCTGACTGTCTCAGGAACTGTGTCAATAAGGACTACTTCCTAGCGGGCGCCGCGCCGGTGCTGGCGCCGCCTCCGCCCATCGCAGCGCCTCAACCCCCAGCGCCCGCCTACACTGAAGCGGGTATGAACGGCGGCTACTACGGCGGCGGGGGCGCTCAGGCCGCTCACTCCACGACGGCGGTCCACGAGGCGCCCGCGCTGCCGCCCCAGACGGCGCCGCAACCTCCGCCTCAACCGCCCATCACGCTCCCGATAAACACGTCGGACCCGGCCAATCATCAGCTGAAGCAGGAGAGCTCGGACGTTCACTAA

Protein sequence:

>DPOGS209858-PA
MDKRKMLPKRARMDSMRGPIANGPLQSRPLVALLDGRDCTVEMPILKDVATVAFCDAQSTSEIHEKVLNEAVGALMWHTIILTKEDLEKFKALRIIVRIGSGVDNIDVKAAGELGIAVCNVPGYGVEEVADTTMCLILNLYRRTYWLANMVREGKKFTGPEQVREAAAGCARIRGDTLGIVGLGRIGSAVALRAKAFGFNVIFYDPYLPDGIEKSLGLTRVYTLQDLLFQSDCVSLHCSLNEHNHHLINEFTIKQMRPGAFLVNTARGGLVDDEGLAAALKQGRIRAAALDVHENEPFNVFQGPLKEAPNVLCTPHAAFYSDASAQELREMAASEIRRAIVGRIPDCLRNCVNKDYFLAGAAPVLAPPPPIAAPQPPAPAYTEAGMNGGYYGGGGAQAAHSTTAVHEAPALPPQTAPQPPPQPPITLPINTSDPANHQLKQESSDVH-