Monarch geneset OGS2.0

DPOGS210626
TranscriptDPOGS210626-TA1821 bp
ProteinDPOGS210626-PA606 aa
Genomic positionDPSCF300168 + 454043-456290
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0225952e-0722.49% 
BombyxBGIBMGA013961-TA6e-2025.48% 
Drosophila% 
EBI UniRef50UniRef50_Q8MY335e-12244.27%Reverse transcriptase n=9 Tax=Endopterygota RepID=Q8MY33_9NEOP
NCBI RefSeqXP_001949771.13e-7534.04%PREDICTED: similar to Putative 115 kDa protein in type-1 retrotransposable element R1DM (Putative 115 kDa protein in type I retrotransposable element R1DM) (ORF 2) [Acyrthosiphon pisum]
NCBI nr blastpgi|220040041e-12244.39%reverse transcrpitase [Papilio xuthus]
NCBI nr blastxgi|220040047e-12944.46%reverse transcrpitase [Papilio xuthus]
Group
Gene OntologyGO:00039648.7e-05RNA-directed DNA polymerase activity
GO:00037238.7e-05RNA binding
GO:00062788.7e-05RNA-dependent DNA replication
KEGG pathway 
InterPro domain[42-251] IPR0051352.7e-42Endonuclease/exonuclease/phosphatase
Orthology groupMCL10019 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210626-TA
ATGGCCAATGTCCGAATTTCCGGCCTGGACGACTCAGTAACCCCGGAGGAAATTCGTCTGGCGCTGGCAGAAAAAACTGGAGTCTCCCCGGAAGACTTCAAAGTCGGGCTTATTACCCACGGGCCCATAGGTAAAGTTCTTCAGGGGAACTTAAACCACGCGGTCGCAGCACAGGACCTCTTGTGCCAGACTGTGGCCGAGTGGAACATAAATGTAGCTGTCATTGCGGAGCCATACTCTATACCCCGAACCCATAAATGGGCCGGGTCAGTGGATGGTTCCGCGGCTATTTTCTTTCCCGGCGTGGCCTGCACTCACTCCGTTGTGGAGAGAGGAGCGGGCTTTGTGGCAGCTCGATGGGGAGAAGTAGTGGTGGTCTCTACATACTTCTCCCCAAACCGCAGCCGGGCCGACTTTGAGTCGTTCCTGGCTACGGTTGAAGGAGTCATCCTTCGGGTGGCCCCCAGTCCGGTGCTGGTGGCTGGGGACCTCAATGCGTGGTCTCGCGCTTGGGGCTCTATCAGAACTAACGCCCGCGGTCGTGTCCTGGAGTCCTGGGTTCTGTCATTGGGACTCCAGATTCTCAATAGAGGCACCACCCCAACCTGCGTCCGGTGGCAAGGCACATCCATAGTGGACGTGACCTTTGCCACCCCATCACTTGCCGCCCGCATCAGGGACTGGCGGGTTATGGCGGAAGCGGTGACCCTTTCGGACCATCGGTACGTCCGATATGAGGTCTCCCCGACGTCCCCTGGGACACCTTTCCAGCTGGGTACCAGACCACCTTTCCCAAGGTGGTCACTTGTTCGCCTCCAACCCGATGTGGCTGAGGAGGCAGCGATGGTGAGAGCATGGGCCGAAGTGCCCGACACTATCGCTGGGGATGCGGACAGCATGGCGGACCTCTTTGCGGATGACATTAAGGTTGTCTGCAACGCCGCCATGCCGAAGACGCAGGCCTGCCCACGAAACAGGGGGAAAGTGTATTGGTGGACGCAGGAACTGTCCAGCCTGCGTACCGCCAGTATGGGAGCCAGACGCGCCTACCAGCGTTGCCGTAGGCGCGCCCGAGGAACGCCCGTTGTAGAGGAAGCTCTATACCGGGCCTACCAGGACGCCAACAAGGCATTGCGGACGGCCATTCGCAAGGCCAAAGAGGATGCCTGGGACCAGTTTCTGGGTATCCTCAATAACGACCCTTGGGGTAGGCCCTACAGGGCGATTAGGGGGAAGTTGTCTACTGCAGCTTCCCTTACCTCCTGTATGGAGCCTGGGCTGCTGCGGAGGGTACTGGGGACGTTGTTCCCTGATCCGGGACCTTTCGCACCTCCGTACATGGCAACCGCTGCTCTCGCTCAGGGAGAGCGGGTTGACGGCCCTCCTGTGTCGGACGTGGAATTCAATACGATCCGTTCAAGACTCCGTCACAAACGCAAGGCGCCGGGGCCGGACGGGGCCCCCTCTAAGGTGATGGATATCGCCTTGGGACCCCTGGAGGACCGGTTTCGAGCAGTGCTCGATACCTGCATGGCGGCGGCCCGCTTCCCCAGGCGATGGAGAGTAGGGCGGCTGTGCCTAATCCGTAAGGAGAGCCGTCCGGCGGATGCCCCAGAGGGATACCGGCCAGTGGTGCTACTGGATGAGGCGGGGAAGGCTTTCGAAAAGATTCTCGCCTCCCGCATCATCCAGCACCTGGAAGGTAGAGGACCCGACCTGGCGGAATGCCAGTACGGCTTCCGTACTGGTCGGTCTACGATCGACGCGGTGACCCGCCTAAAGAGGTGGACTGAGGCGGCTTTCAACAGGGGGAGGTGGTGA

Protein sequence:

>DPOGS210626-PA
MANVRISGLDDSVTPEEIRLALAEKTGVSPEDFKVGLITHGPIGKVLQGNLNHAVAAQDLLCQTVAEWNINVAVIAEPYSIPRTHKWAGSVDGSAAIFFPGVACTHSVVERGAGFVAARWGEVVVVSTYFSPNRSRADFESFLATVEGVILRVAPSPVLVAGDLNAWSRAWGSIRTNARGRVLESWVLSLGLQILNRGTTPTCVRWQGTSIVDVTFATPSLAARIRDWRVMAEAVTLSDHRYVRYEVSPTSPGTPFQLGTRPPFPRWSLVRLQPDVAEEAAMVRAWAEVPDTIAGDADSMADLFADDIKVVCNAAMPKTQACPRNRGKVYWWTQELSSLRTASMGARRAYQRCRRRARGTPVVEEALYRAYQDANKALRTAIRKAKEDAWDQFLGILNNDPWGRPYRAIRGKLSTAASLTSCMEPGLLRRVLGTLFPDPGPFAPPYMATAALAQGERVDGPPVSDVEFNTIRSRLRHKRKAPGPDGAPSKVMDIALGPLEDRFRAVLDTCMAAARFPRRWRVGRLCLIRKESRPADAPEGYRPVVLLDEAGKAFEKILASRIIQHLEGRGPDLAECQYGFRTGRSTIDAVTRLKRWTEAAFNRGRW-