Monarch geneset OGS2.0

DPOGS205140
TranscriptDPOGS205140-TA2178 bp
ProteinDPOGS205140-PA725 aa
Genomic positionDPSCF300246 + 7124-13999
RNAseq coverage2x (Rank: top 91%)
Annotation
HeliconiusHMEL0107282e-0727.43% 
BombyxBGIBMGA008617-TA2e-2556.07% 
Drosophila% 
EBI UniRef50UniRef50_Q0N4122e-1642.57%P87/VP80 n=1 Tax=Clanis bilineata nucleopolyhedrosis virus RepID=Q0N412_9ABAC
NCBI RefSeqXP_002023625.14e-1740.00%GL19903 [Drosophila persimilis]
NCBI nr blastpgi|1131954906e-1642.57%P87/VP80 [Clanis bilineata nucleopolyhedrosis virus]
NCBI nr blastxgi|2099788521e-1437.01%hypothetical protein [Adoxophyes orana nucleopolyhedrovirus]
Group
Gene OntologyGO:00039644.9e-09RNA-directed DNA polymerase activity
GO:00037234.9e-09RNA binding
GO:00062784.9e-09RNA-dependent DNA replication
KEGG pathway 
InterPro domain[250-364] IPR0004774.9e-09Reverse transcriptase
Orthology groupMCL23296 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205140-TA
ATGTCAAAAAGTGCTCTTATTAGAAATCAGATCGTGCATCATGTATCCAACAACTGGCAGAGGTTCAAATATTTTACTCAGCAAGAATCTAGTGAACCTTATGGTACTAAACGTTTGTACTTCGAGGACATGTCGAAGCCTTATACACAAGGATCTCTTTGTGAAGTTAAAGCGGCGGCAGAATTGTTTCCATATCAGTTTCAAGTATATCAAGATGGCTGTCTCAGGGCTACATTTGGTGAAGCATCAGATGGAATTAAAAGACTCAGGTTTCCTGGAAACCACAACAACGGTCATTATGACGTTTTAGTTCCATTATTTGAACCAATAAGTATTGATATTACTCAACCATCTAACAATCGTCATCAAAATCTTGAAACAGAGACCGAAGAAGCAACAAATTTGTCTTCCTCTCAATTAACTTCTGACACAACTTTCTTGAGTAGTTATGGATCTGCTTTTGTAACTGAAATATCAATACGATCAAAAAGTGGAACAAAACGAAGAAAAAGATTTTCTAGAGCAATTAGAGCTAAACAACTGAGAAAAGCTGCTTCAAAATATACAAAAAACCATAGAGAAGTTCATAGAGAGGCCGCTAGAAAATATAATGAAAGTCATCCTGAAATCTTAAAACAAACCACCCAAACGTATCGTCAAACTCATCCTGATGTAGTGAAAAAAATTCACAAGGCTTACAAGGAAAAAACTCCACACGTCAATCGAGCAGCTTCTGCCAAATATGATGAAAAGAGATCAAAATCTAAATTATTGTCCTGGACTTCTAAACATTTGTCCGGTTTCAAGTACGATAGTAGAACAAATTATTCGAATGATAAAATTGTCAATCTTGGTCAACGTTCTCCTTGCCAATGGTGTCATGCTCTGAAATGGATTGATGAGACTCAAGGAATGTGCTGCAGTAGTGTTAAATCAAAGCATTTAAGCCACCTACGTTTTGCCGACGACTTGGTACTGATATCAGAGACAGTAAAAGAACTACAACACATGGTAGAGTTACTAAACCAAGCGAGCAAGGTCGCCGGGCTAGAAATGAACCTCACTAAAACTATGGTAATGACCAACAACATCAAGATAAATGATAATATCTCTACTTTGTCGACATTAAAGGAGAAATTTGGAGAAGATGATGTTTTGAGTTTGACTAGTTCAAAAGAGCGCTGTGATTCCGACCTAGAAAGTAAAGCTTCATCATCAGTATCATTGCAAAAGTCAAAACTTAATCAATTAACTCTGGATTCATGTGTTAATAATATTAAAACATTTAGTGAAAAAGTTGTAGATGTTGTTACTGATAACGGTGCAAATATCGTCAAAGCCGTAACAATCGCATTTGGCAAACAAAAACACTTATGGTGTTTCGCCCACACCTTAAATTTGGTAGCTCAAAAACCTTTTGACGAGAAAGGTGGTATTGAAAATACCCAACAGTTACTTAACATTGTAAAAGACATAACTAGATATTGTAAACAAAATCCAAATGTTGCTGATGCTCTTCCCAAAGCGCAAAATGATGTGGCAGTACCACTGAAATTGATTCAAGGTGAATCAGGATTCCGAGCGGCAGGCATTGTCCCATATAACCCCGACCATTTCACTGAAGAAGATTTTATCGCAGCTGAAGTGTTAGCTCAACCTGCTGTTATAGTTCAAGATTCATCCATGGTTATACCTTCATCTACTTCAGTTAATCCAATGACAGCCACATCATCTAATGCAATTATTCCAGCGCCATCGACATCATCTGCTACCATAGATCCTGTGCCATTGACATCATCTGCTACTATAGATACAAGGCTGCTTACATCTCCCACAGTAATGAATGCAACGCCATCCACTTCGTCCGGAAATATACAATCAAATATCGCAAATCTGCTTCCATTTCCGACCCAAACAACTCGTGAAAGACTCGTAAAGAAAGGACAAATGAAGCAACGCGCAAAGATTTTAACATCAACACCCGTTAGAGATAGTCTCATAGAAAAAGAAAATAAGAAAGCCACAAAAGCAGCAAAAACTGTAAAAACTTCTAAACCATCAAAAGACAAAAATGCAGCACCGAAAGCTAAAGGAAAACAGAAAAAAAAGTCGCAAATAAGGCCAAAAGACGCGTGCTGCAGGAGTCTAACAGCTCTTCGGATCAGACATCAGTCTTAA

Protein sequence:

>DPOGS205140-PA
MSKSALIRNQIVHHVSNNWQRFKYFTQQESSEPYGTKRLYFEDMSKPYTQGSLCEVKAAAELFPYQFQVYQDGCLRATFGEASDGIKRLRFPGNHNNGHYDVLVPLFEPISIDITQPSNNRHQNLETETEEATNLSSSQLTSDTTFLSSYGSAFVTEISIRSKSGTKRRKRFSRAIRAKQLRKAASKYTKNHREVHREAARKYNESHPEILKQTTQTYRQTHPDVVKKIHKAYKEKTPHVNRAASAKYDEKRSKSKLLSWTSKHLSGFKYDSRTNYSNDKIVNLGQRSPCQWCHALKWIDETQGMCCSSVKSKHLSHLRFADDLVLISETVKELQHMVELLNQASKVAGLEMNLTKTMVMTNNIKINDNISTLSTLKEKFGEDDVLSLTSSKERCDSDLESKASSSVSLQKSKLNQLTLDSCVNNIKTFSEKVVDVVTDNGANIVKAVTIAFGKQKHLWCFAHTLNLVAQKPFDEKGGIENTQQLLNIVKDITRYCKQNPNVADALPKAQNDVAVPLKLIQGESGFRAAGIVPYNPDHFTEEDFIAAEVLAQPAVIVQDSSMVIPSSTSVNPMTATSSNAIIPAPSTSSATIDPVPLTSSATIDTRLLTSPTVMNATPSTSSGNIQSNIANLLPFPTQTTRERLVKKGQMKQRAKILTSTPVRDSLIEKENKKATKAAKTVKTSKPSKDKNAAPKAKGKQKKKSQIRPKDACCRSLTALRIRHQS-