Monarch geneset OGS2.0

DPOGS201207
TranscriptDPOGS201207-TA1509 bp
ProteinDPOGS201207-PA502 aa
Genomic positionDPSCF300262 + 677114-682613
RNAseq coverage0x (Rank: top 95%)
Annotation
HeliconiusHMEL0225951e-1026.91% 
BombyxBGIBMGA011035-TA3e-1635.92% 
Drosophila% 
EBI UniRef50UniRef50_P213286e-2934.29%RNA-directed DNA polymerase from mobile element jockey n=8 Tax=Drosophiliti RepID=RTJK_DROME
NCBI RefSeqXP_001948682.13e-4234.78%PREDICTED: similar to reverse transcriptase homolog [Acyrthosiphon pisum]
NCBI nr blastpgi|3265788469e-4256.35%reverse transcriptase [Quentalia chromana]
NCBI nr blastxgi|3265788465e-4055.37%reverse transcriptase [Quentalia chromana]
Group
Gene OntologyGO:00039641.1e-14RNA-directed DNA polymerase activity
GO:00037231.1e-14RNA binding
GO:00062781.1e-14RNA-dependent DNA replication
KEGG pathway 
InterPro domain[370-481] IPR0004771.1e-14Reverse transcriptase
Orthology groupMCL10172 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201207-TA
ATGGAAACAGTACCAGTTATATGCTCAGGAAATACAATGATTAGAACCATAGTTGATAATTCAGAGTTCGACATTACAGTCAAGGACGTATTATGTGTTCCTAACCTATCAGCAAATTTGTTATCAGTGAGTCAGCTTATTCAAAATGGAAACAAAGTCAGTTTTGAAGAAGAAGTTTGCTATATACGCAATAGACAGAATATACTGATTGGAAAAGCAAATTTGGTGAATGGAGTGTACAAACTAAATGTAAAGTTAGAAAGTTTGGTGGCGTGCTCAGCTACAACAATAACAAGTGAGATTTGGCACCGTAGACTAGGACACATTAATAGCAGGGACCTGAATGCTATGAGGAATGGCGCTGTAGATGGAATATCATACATTGGTAAAGCGGAAATCGACAGATCTAACTGCATCACATGTTGTGAAGGCCAAGCTAAAGATGCTCATGATTTCTTGTCATTTACAAGATACATAGTTCAGACTTATTTGATACTTGGCAAATCTTCTACATCCATACCAAAACGAATTGGAGGCAAGACACCGGCAACTACTAAAAGTAAACTGCCGGATTCAGTAAGACTCGATGGGAAGGACCATTATATTATTAGAAATGAAACTCAAATTCGATGCCGCGAATGTCACAAGAATACAAAATTCAAATGCAAACAACCGATCCGGCCCTTGCTCGCGCGAGATGGCACTCCACGCTACCGCGCGGCGGACCGAGCAGAGATCTTCGCCGAACACCTGGAGACCCAATTTCAGCCGAATCCCTCTAGAAAAACGCAGCACGCAGAAAAAGTACAGAATACGATTCGCCGCAGAAAGCTTCAGAAGGCCCCCGGTCCTGATGGAATCACTAATGAAACCCTCCGCCATCTTCCCTCGCGTGGCATAGCAGCCGTGACGCGCTTGTATAATGGAGTCCTCAGGACCGGCTACTTAACTACCCAATGGAAGCTCGGAAGAGTCATCATGCTACCGAAGCCAGGAAAGAACATCTTGTTGCCAGGGAGCTGTCGCCCCATCCACGCTCCTGTCAACCGTCTCGAAGGTTTTCGAGAAACTGCTACTGCTGCACCTGATACCACACATCCAACCACGCGACGAACAGTTTGGCTTCAGAGCGGAGCACTCAACCACGCTTCAGCTCGCGAGGGTCCTGCACGTCCTCTCGGTAGCTCTCAACAAGAACGAGTCAGCCGTCGCCTCGACATGGAGAAAGCTTTCGATCGCGTATGGCACCCCGGCCTATTGTACAAGCTGGCCACATCCACTACTCCTCGCCGAATAGTAAGGATTATGGCCACCTTGCTGCAAGACAGGCGTTTCCAGGTGTCGGTGGAAGGCACCCTCTCCACGCAACGCCCCATTAGAGCCGGAGTGCCGCAGAGGAGCTGCCTGTCTCCGATATGTTACTCCAAGTATACGGACGACATCCCAGTGGCGGAAGGTGAGACCTTAGCGCCTACGCCGACGATGCTGCCTACATTACAACATCTCTGA

Protein sequence:

>DPOGS201207-PA
METVPVICSGNTMIRTIVDNSEFDITVKDVLCVPNLSANLLSVSQLIQNGNKVSFEEEVCYIRNRQNILIGKANLVNGVYKLNVKLESLVACSATTITSEIWHRRLGHINSRDLNAMRNGAVDGISYIGKAEIDRSNCITCCEGQAKDAHDFLSFTRYIVQTYLILGKSSTSIPKRIGGKTPATTKSKLPDSVRLDGKDHYIIRNETQIRCRECHKNTKFKCKQPIRPLLARDGTPRYRAADRAEIFAEHLETQFQPNPSRKTQHAEKVQNTIRRRKLQKAPGPDGITNETLRHLPSRGIAAVTRLYNGVLRTGYLTTQWKLGRVIMLPKPGKNILLPGSCRPIHAPVNRLEGFRETATAAPDTTHPTTRRTVWLQSGALNHASAREGPARPLGSSQQERVSRRLDMEKAFDRVWHPGLLYKLATSTTPRRIVRIMATLLQDRRFQVSVEGTLSTQRPIRAGVPQRSCLSPICYSKYTDDIPVAEGETLAPTPTMLPTLQHL-