Monarch geneset OGS2.0

DPOGS215280
TranscriptDPOGS215280-TA1089 bp
ProteinDPOGS215280-PA186 aa
Genomic positionDPSCF300047 + 657991-665351
RNAseq coverage0x (Rank: top 99%)
Annotation
HeliconiusHMEL0028599e-1230.77% 
Bombyx% 
Drosophila% 
EBI UniRef50UniRef50_D7EKE86e-2539.69%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D7EKE8_TRICA
NCBI RefSeqXP_001948286.18e-2458.02%PREDICTED: similar to blastopia polyprotein [Acyrthosiphon pisum]
NCBI nr blastpgi|2700166602e-2439.69%hypothetical protein TcasGA2_TC016329 [Tribolium castaneum]
NCBI nr blastxgi|2700166603e-3834.91%hypothetical protein TcasGA2_TC016329 [Tribolium castaneum]
Group
Gene OntologyGO:00039645.7e-06RNA-directed DNA polymerase activity
GO:00037235.7e-06RNA binding
GO:00062785.7e-06RNA-dependent DNA replication
KEGG pathway 
InterPro domain[41-114] IPR0004775.7e-06Reverse transcriptase
Orthology groupMCL11503 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215280-TA
ATGTCTGAAGCGAAGATGCAGAAAGGTAATCTAGTTTATGATGAAACTTCAAACACACGCCTCACAGGCATAGCTGTCAAAGAAGTTATTAGTAGTAGATTCCCGTTACCCCTCATCGAAGAACAGATTGACCGATTAAGAAATTGCAAAGTATTTTCTCCGTTAGACTCAGAGAATGGATTTTTACACGTGAATGTTCACAAGGATAGTCGCAAGTATACATCATTTGTTACACCACGAAGCCAATACGAATTCCTGAAAATGCCGTTGGGTCTATGTACAGCACCATCGGTATTTCAACCATTTATAGAGAAAGTTTTCAAAAGAGCTCAAGAACCGAAGGCGTTATGCAGTTATCCTGTACTTCGCATTTACAAACCCGAGCTTGAGACCGAGTTATACACAGACGCAAGAGCAGATGGATATCGAAATGTGTTGTTACAAAGATGTCTGAAAGAACCACCTGCACCCCGTCTACTATACGAGGAAAAAGACTACACAGGCCGAGAGAAATTATTCCAGTTATTTTCTGAGGATATTGGCGATTGCAGTGTCAGTTAAAAAGTTGCGTATGTACCTACTGAGTATAAATTTAAGATCGTTACAGATTGCGCTGCTTTAGCGAAAACTTTAACTAAGAAAGACTTACCTCCTCGTTGGGCTCTTTTACTGGAAGAATACGACTTTACAATCGAATATAGATCTGGAACTAGAACGAAGCATGCAGATACCATAAGCAGATATCCTATTAGTTTGCAATACTTACTGAAACATTTTTACGATGCTCACATTTCTGTTCAAAAACAAAATCTTTTAGAGAATTTGAAAGGATTCCACGTTACTGTTTTGCATAATGGCTACGACCGAGTTACCGCCAAGGGATACAAATATATTGCAAACGAATGCAAATCGATGACTGTTGAGATAGAGTTCTCTTTAGCTCTTCCGAAAAAAGGGATCATATTGTACAATGATAAATTGGGACATGTTCTTCCCTTGATCATTTTGAATAATTGTTCATACTTTCTATTCGCACTGCAAGGATTAGTTACTAATGCCTTCATTGAAGGTATGTGTGTTGTATTTTAA

Protein sequence:

>DPOGS215280-PA
MSEAKMQKGNLVYDETSNTRLTGIAVKEVISSRFPLPLIEEQIDRLRNCKVFSPLDSENGFLHVNVHKDSRKYTSFVTPRSQYEFLKMPLGLCTAPSVFQPFIEKVFKRAQEPKALCSYPVLRIYKPELETELYTDARADGYRNVLLQRCLKEPPAPRLLYEEKDYTGREKLFQLFSEDIGDCSVS-