Monarch geneset OGS2.0

DPOGS202386
TranscriptDPOGS202386-TA1242 bp
ProteinDPOGS202386-PA413 aa
Genomic positionDPSCF300104 + 568200-580913
RNAseq coverage4872x (Rank: top 3%)
Annotation
HeliconiusHMEL0155722e-3067.78% 
BombyxBGIBMGA014478-TA3e-5151.13% 
DrosophilaCG2604-PA9e-10947.09% 
EBI UniRef50UniRef50_E0VTR14e-11349.06%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VTR1_PEDHC
NCBI RefSeqXP_312273.41e-12250.12%AGAP002652-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582907013e-12150.12%AGAP002652-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582907015e-11750.47%AGAP002652-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00551142.1e-27oxidation-reduction process
GO:00164912.1e-27oxidoreductase activity
GO:00054881.7e-17binding
KEGG pathwayprw:PsycPRwf_21234e-36 
 K00292 (E1.5.1.9)maps-> Lysine degradation
InterPro domain[7-282] IPR0050972.1e-27Saccharopine dehydrogenase / Homospermidine synthase
[6-181] IPR0160401.7e-17NAD(P)-binding domain
Orthology groupMCL10525 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202386-TA
ATGTCTCGTTTGGATCTCGTGATATTTGGCGCGACGGGCTTCACGGGGAAGCATGCCGCAATGGAGATGTGCCACATCGTCAAGAAGTATCCTGGCATGAGTTGGGGAGTGGCGGGTCGCTCAGAGGGCAAACTCAACAACTTGCTGAAAGAAGTATCCAAGAAAGTCGACGAGGACTTGTCGTCGGTGAAGGTGATAACAGCGGAGTTGTCGGACGAGGCCTCTCTCAAGGCGATGACGGCCCAGGCCCGCGTGCTGGTCAACTGTTGCGGGCCTTACTACCTGTACGGGGAGCCTGTCGTCAAGGCCTCCATAGACACCAAGACACACTACGTGGACGTCAGCGGGGAGCCGCAGTTCATGGAGAGAATGCAGCTGGTGTACGGGTCGGCGGCGCGCGAGGCCGGCGTCTTCATTATCAGCGCCTGCGGCTTCGACAGCATCCCCAACGACCTCGGGGTCATCTTCCTGCAGCAAAATTTCGGAGGTACCTTGAACAGCGTGGAGTCCTACCTGTCAGGAGAGGTGCCTCCCGAGCACAGCGGGGGCGGAGTCGTCAACTACGGGACCTGGGAGTCGCTTGTACACGGGATGTCGCACCACAACGAGCTGCCGGCGCTCAGGAAGAAACTGTACCCTGAGAAACTACCCACGTACAGACCCAAACTAAAGCCCAGATTCATGATCCATCGGCGCGGCGGCTGGTGCCTGCCGTTCCCGGGCTCCGACTCGTCGGTGGTGTTCCGTACACAGAGACAGCTCCATGCCGAGGGTTCGCGCCCGGCCCAGGTCCGCACGTACGTGAGGCTGCCGTCGCTGGTGTCGGCGCTGATCACCATGTTTGTGGCGAGCGTGGTGTTCCTCATGAGCAAGCTGTCCTTCACCCGCTCGCTGCTGCTCGCGTACCCGGAGCTGTTCTCGCTGGGCGCGGTCCGCCGCGGACCCTCCGAGGACGCTATACGGAACACCAGGTTCAGGTTCGAGCTGTACGGAGAGGGATGGAGCGGTGACAGCGGATCCCCGCCGGACAAGAAGATGACTGTCAGGGTGTCGGGAGTCAACCCCGGCTACGGCGCCACGGTCCACGCCCTGCTGCACTCCGCCATCACCATACTCAGGCACAGGGACCGGATGCCGGCGCAGACGGGCGTGCTGACTCCCGGGGCGGCGTTCCGGAACACAGACCTCATACAGCGACTCTGCGACCACGGCCTGCTGTTCGAGGTCGTCCGCGACCAGTGA

Protein sequence:

>DPOGS202386-PA
MSRLDLVIFGATGFTGKHAAMEMCHIVKKYPGMSWGVAGRSEGKLNNLLKEVSKKVDEDLSSVKVITAELSDEASLKAMTAQARVLVNCCGPYYLYGEPVVKASIDTKTHYVDVSGEPQFMERMQLVYGSAAREAGVFIISACGFDSIPNDLGVIFLQQNFGGTLNSVESYLSGEVPPEHSGGGVVNYGTWESLVHGMSHHNELPALRKKLYPEKLPTYRPKLKPRFMIHRRGGWCLPFPGSDSSVVFRTQRQLHAEGSRPAQVRTYVRLPSLVSALITMFVASVVFLMSKLSFTRSLLLAYPELFSLGAVRRGPSEDAIRNTRFRFELYGEGWSGDSGSPPDKKMTVRVSGVNPGYGATVHALLHSAITILRHRDRMPAQTGVLTPGAAFRNTDLIQRLCDHGLLFEVVRDQ-