Monarch geneset OGS2.0

DPOGS202387
TranscriptDPOGS202387-TA1332 bp
ProteinDPOGS202387-PA443 aa
Genomic positionDPSCF300104 + 586795-597260
RNAseq coverage989x (Rank: top 13%)
Annotation
HeliconiusHMEL0155727e-2046.94% 
BombyxBGIBMGA014478-TA1e-3038.57% 
DrosophilaCG2604-PA2e-7937.81% 
EBI UniRef50UniRef50_E0VTR13e-8740.63%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VTR1_PEDHC
NCBI RefSeqXP_001656085.12e-9039.31%hypothetical protein AaeL_AAEL002861 [Aedes aegypti]
NCBI nr blastpgi|1571325904e-8939.31%hypothetical protein AaeL_AAEL002861 [Aedes aegypti]
NCBI nr blastxgi|1571325909e-8539.40%hypothetical protein AaeL_AAEL002861 [Aedes aegypti]
Group
Gene OntologyGO:00551147.1e-29oxidation-reduction process
GO:00164917.1e-29oxidoreductase activity
GO:00054882e-18binding
KEGG pathwaystp:Strop_33962e-25 
 K00292 (E1.5.1.9)maps-> Lysine degradation
InterPro domain[26-422] IPR0050977.1e-29Saccharopine dehydrogenase / Homospermidine synthase
[26-205] IPR0160402e-18NAD(P)-binding domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202387-TA
ATGGCACTGGCCATACGTACGTTGTTACCTGGGTTGGGTTTGTTACAGTTGTGTAAAATGAATAGATTCGACCTGGTAATATTCGGGGCGACCGGTTTCACGGGGAAGCATGTAGTCAGAGAGCTCGTCAGGATCGCCCCTAAACATCCGGGGTTGACTTGGGCGGTGGCTGGGAGGAGCCGGGGGAAAATGGAAACGCTTCTACAGAATGTCACCAAAAAAACTGGGGTTGAGCTATCTAACATTAAAATAATAACAGCCGACGTGAATGACACGGAATCTCTGAAACGCATGTGCCGTCAGGCTAGAGTACTAGTGAACTGTTGTGGTCCGTTCCTGAAATACGGCGAGCCTGTGGTGGCCGCGGCGATCGAGACGAAGACACATCACGTAGATGTCAGCGCTGAGATAAAGGTGAGGGCGAGATCGGTGAGGGAGTTCACTGAGATGTTGCATCAGAGGTACGACGCGTCAGCTCGCGAGGCCGGGGTCTGCGTCGTGTACGCCTGCGGCTTGTGCAGTCTGCCCGCGGATGTGGGACTCCTGTACCTCCAGCGAGAATTCCAAGGCGTGTTGAACTCGGTGGAATCATATCTCGTGACTCACTTCCCGCCGAGGATGTTGACTGAGACCTGGAAAAATGGAATCGTTCGACACAGCAGCTGGGAGTCGTTTATAAACGGCATGGCCGGCATGTCGATAAAACAGTTGCAGCAAAACTCACTGCCGCCGATGGAGCGCGAACTGAAAAGCAGATGTTTAATACATAGGAATCTCAACAAATGGTGCGTTCCGTTTCCTGGTGCAGACGCGGCCGTCATCTCTCGCACGCAGCGCCATCTCTTTTCGACGACCGACAAACGACCCGTACAATACAAACCGTACATAACTTTTCCCTCTATTTTTACAGCCATAGGAACTATTATCGCCTGTGTTATTTTGTTCGCCCTTAGTAAAATGTCGTGTAGTAGAAAATTACTGCTGAACTATCCACGACTGTGTTCGTTCGGTGTGGTCACGTATGGCGACACTGAAGAGGGCGTAATGGATGACGCATACTTTCAATATGAAATGATCGGTAACGGTTGGAGTACAGGTGCGGATCGTAGTGGAGCGCCTGACAAAAATGTGATAGCAAGGATCAAGGTTTCAGGAGTGGATCCGGCTTATGTTGGCTCTGCGATTGTCCTTATTTATTCTGCAATAACTTTGTTGAAGGAAAAAGACAGAATCCCCGACTGTGGCGTGATGACTCCCGGAGTAGCCTTCCGAAACACTAATATCATAGAACATCTTAAAGCAGACAATGTAAAATTTGAAATTGTTAAATAA

Protein sequence:

>DPOGS202387-PA
MALAIRTLLPGLGLLQLCKMNRFDLVIFGATGFTGKHVVRELVRIAPKHPGLTWAVAGRSRGKMETLLQNVTKKTGVELSNIKIITADVNDTESLKRMCRQARVLVNCCGPFLKYGEPVVAAAIETKTHHVDVSAEIKVRARSVREFTEMLHQRYDASAREAGVCVVYACGLCSLPADVGLLYLQREFQGVLNSVESYLVTHFPPRMLTETWKNGIVRHSSWESFINGMAGMSIKQLQQNSLPPMERELKSRCLIHRNLNKWCVPFPGADAAVISRTQRHLFSTTDKRPVQYKPYITFPSIFTAIGTIIACVILFALSKMSCSRKLLLNYPRLCSFGVVTYGDTEEGVMDDAYFQYEMIGNGWSTGADRSGAPDKNVIARIKVSGVDPAYVGSAIVLIYSAITLLKEKDRIPDCGVMTPGVAFRNTNIIEHLKADNVKFEIVK-