Monarch geneset OGS2.0

DPOGS208126
TranscriptDPOGS208126-TA1173 bp
ProteinDPOGS208126-PA390 aa
Genomic positionDPSCF300154 + 82428-85022
RNAseq coverage7x (Rank: top 86%)
Annotation
HeliconiusHMEL0143737e-1227.68% 
BombyxBGIBMGA012927-TA3e-1332.81% 
Drosophila% 
EBI UniRef50UniRef50_D7EL111e-2838.61%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D7EL11_TRICA
NCBI RefSeqXP_001947960.13e-3746.25%PREDICTED: similar to polyprotein [Acyrthosiphon pisum]
NCBI nr blastpgi|2700168892e-3243.79%hypothetical protein TcasGA2_TC006964 [Tribolium castaneum]
NCBI nr blastxgi|2700168898e-4543.79%hypothetical protein TcasGA2_TC006964 [Tribolium castaneum]
Group
Gene OntologyGO:00039641.6e-21RNA-directed DNA polymerase activity
GO:00037231.6e-21RNA binding
GO:00062781.6e-21RNA-dependent DNA replication
KEGG pathwayhmg:1002088698e-15 
 K03010 (RPB2)maps-> Huntington's disease
    Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[187-341] IPR0004771.6e-21Reverse transcriptase
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208126-TA
ATGAATGTACGCGGCGCGTGTGAAGTCGGTAACGCGGTTCGCGGGATACTGACAGCCTCCGTTCTCCGAGGGGCGAGCCTGCGCGGTCCAAATCCGTCTAGAAATACACAGCACGCAGAAATAGTTCAGCAGCTACTGAGGAGCTACGTTGAGCAGACGATAGCCCCAGAGAAAGACCCCATCGTCCTCTCCCCTGGCCAAGTGCAGCGTATGATTCGCCGCACTAAGCTTCGAAAGGCCCCGGTTCTGACAGAATCACTAACGAAGTTCTCCGTCACATTTCTGCGCGTAGCACAGCAGCGGCGACGCGCTTACATAATAGAGTCCTCAGGACCGGCCACTTCCCTATCCAATGGAGGCTCGGCCGAGTCATCATGCTGCCGAAGCCTGGGAAGAACATCTTGCTGCCAGGGAGCTGTCGCCCCATCACGCTGCAGTCAACCGTCTCGAAGGTTTTCGAGAAACTTCTGCTGCTGCACCTCACACCACCCGTCCCACCACGCAATGAACAGTTTGCCTTCAGAGTTGAGCACTCAACCACGCTCCACAGCGCTCGCGAGGGTCCTGCACGTCCTCTCGGTGGCTCTCGACAAAAACGAGTCAGCCGTCGCAGTGACGCTCGATATGGGGAAAGCTTTCGATCGCGTATGGCACCCCGGCCTATTGTACAAGCTGGCCACATTCACTACTCCTCGCCGAATAGTCAGGATTATGGCCACCTTCCTGCGGGACAGGCCTTTCCAGGTGTCGGTGAAACCCACCCCATCCACGGAACGCCCCATCAGAGCCGGAGTACCGCAGGGGAGCTGCCTGTTTCCGATATGTCACTCCAACTATACGGATGACATCCCAGTGGCGGAACGTGCGACCTTAGCACTCTACGCCGACGACGCTGCCTACATTACAACATCTCTGACTCCCGCCCACGCGGCGATGAAAATGCAGCGTCATATTGACCAGCTCCCTCAGTGGCTGGATAAATGGCGTCTCAAAGTGAAGGTCTCGAAGACCCAGGCGATTTCCATGGGTCGGCGACACCTACCGCAGCTGGCTGGCTGGCTCGAGCGGGGTAGCGGCGAGAAGAGCCGAAAAGAAGAGAGACCACAAGAAGATGACCCCATCGGACGCCGGCTGGATTCTAACCACCGGGGACGTGTCCTCGCCCACAGTTAG

Protein sequence:

>DPOGS208126-PA
MNVRGACEVGNAVRGILTASVLRGASLRGPNPSRNTQHAEIVQQLLRSYVEQTIAPEKDPIVLSPGQVQRMIRRTKLRKAPVLTESLTKFSVTFLRVAQQRRRAYIIESSGPATSLSNGGSAESSCCRSLGRTSCCQGAVAPSRCSQPSRRFSRNFCCCTSHHPSHHAMNSLPSELSTQPRSTALARVLHVLSVALDKNESAVAVTLDMGKAFDRVWHPGLLYKLATFTTPRRIVRIMATFLRDRPFQVSVKPTPSTERPIRAGVPQGSCLFPICHSNYTDDIPVAERATLALYADDAAYITTSLTPAHAAMKMQRHIDQLPQWLDKWRLKVKVSKTQAISMGRRHLPQLAGWLERGSGEKSRKEERPQEDDPIGRRLDSNHRGRVLAHS-