Monarch geneset OGS2.0

DPOGS203159
TranscriptDPOGS203159-TA1173 bp
ProteinDPOGS203159-PA390 aa
Genomic positionDPSCF300035 - 822515-842799
RNAseq coverage898x (Rank: top 14%)
Annotation
HeliconiusHMEL0109667e-16481.79% 
BombyxBGIBMGA007047-TA5e-1828.81% 
DrosophilaCG8888-PA1e-11553.46% 
EBI UniRef50UniRef50_Q7K3N41e-11353.46%CG8888 n=10 Tax=Diptera RepID=Q7K3N4_DROME
NCBI RefSeqXP_967401.11e-13660.76%PREDICTED: similar to GA21392-PA [Tribolium castaneum]
NCBI nr blastpgi|910774522e-13560.76%PREDICTED: similar to GA21392-PA [Tribolium castaneum]
NCBI nr blastxgi|910774522e-13060.76%PREDICTED: similar to GA21392-PA [Tribolium castaneum]
Group
Gene OntologyGO:00054889.5e-38binding
GO:00081521.5e-16metabolic process
GO:00164911.5e-16oxidoreductase activity
KEGG pathwaycin:1001763967e-48 
 K00019 (E1.1.1.30, bdh)maps-> Butanoate metabolism
    Synthesis and degradation of ketone bodies
InterPro domain[102-299] IPR0160409.5e-38NAD(P)-binding domain
[104-273] IPR0021981.5e-16Short-chain dehydrogenase/reductase SDR
[105-122] IPR0023477.4e-07Glucose/ribitol dehydrogenase
Orthology groupMCL15660 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203159-TA
ATGGCCGCCGGAGACGTGACGCGGCGAGCATCCATCACAGCACCCTCGATGCATCGCCGCCCTTCGGCTAGGAGAGGTTCACTGATCAAGAGCTCCCAACCCTCGTCATCTCAGGAGGTGCCATGGGACATAATAGACCGGTGCGCTCTACCGGTAGTGCTTTGCCACGCTTTAGCGGTAGTTCTTTCAGCACTGTTAAACGCTTTACATCTCAGCCAAATATCAGTCTTCACTTTGTTCCTCTGGTTTGCCATCTCAGTAACCGGTTCCCTCTGGTTCTACCATAATCTTCAGGTAACAGCAGCAGGGAAAGCGGTTTTGGTGACAGGTTGTGACAATGTGTTGGGAAATGCTCTGGCTAGAAGATTGGATGACTTGGGCTATCATGTGTTCGCGGGTTTTCAAAACAAGGCAGGCAACATTGATGCCGACATGCTCAAAGAAGACTGTTCCGGAAGGTTGCACACCTTGCAACTTGACATCACATCAGAAACACAGATTCTTTCAGCGTCTCTGTACATAGTTGATCACCTGCCAGAGGGCGCTCAAGGTCTTTGGGCAATCGTGAACTGCGAATCCTGGTGTGCACTGGGCGAACTAGAATGGGTGCCGTTTTCCGTAATACGACGCGCCATGGAAGTTAATCTGTTGGGACCAGCTCGTTTAGTTCAAGTGATGCTGCCGTTGGTGCGCCGTGCTCGTGGACGTGTTGTTCTGGCATCTTCGATCCTAACTCACGTGGCTGCTCCAGTACGAGGTGTTCATGCAGCTTCACTAGCTGCCCTGGACGCGCTTGCTGCCTGTCTGCGGCGAGAACTTAAGCCCAGGGGTGTTGATGTTGTCGTTGTCGCTGCGGGTGAATACACTACAGGTAGTGCTTGGCTCTCCGAGGAGAAACTTCTAGAGCAAGCTAGGGATATGTGGAAAAGACTCAGCGACGAACAAAAGGGCGCCTACGGAGAGGATTACTTCGAACAAGCGTTGAGGAGCCTTGAGAAATACACAAAAAGCCCTGACGCTGACCTAACGGCTGTGACGCGGGCGCTTAGTGATGGCGTCACCCGCACCTTCCCCCTGGCTCGTTACACCCCGGTCTCACCGCGAGAGAAACTGAAGTCCCTACTAGCTGAGCACATGCCCCGATCTCTTTACGAAGGACTCTACGCCGACTAG

Protein sequence:

>DPOGS203159-PA
MAAGDVTRRASITAPSMHRRPSARRGSLIKSSQPSSSQEVPWDIIDRCALPVVLCHALAVVLSALLNALHLSQISVFTLFLWFAISVTGSLWFYHNLQVTAAGKAVLVTGCDNVLGNALARRLDDLGYHVFAGFQNKAGNIDADMLKEDCSGRLHTLQLDITSETQILSASLYIVDHLPEGAQGLWAIVNCESWCALGELEWVPFSVIRRAMEVNLLGPARLVQVMLPLVRRARGRVVLASSILTHVAAPVRGVHAASLAALDALAACLRRELKPRGVDVVVVAAGEYTTGSAWLSEEKLLEQARDMWKRLSDEQKGAYGEDYFEQALRSLEKYTKSPDADLTAVTRALSDGVTRTFPLARYTPVSPREKLKSLLAEHMPRSLYEGLYAD-