Monarch geneset OGS2.0

DPOGS205816
TranscriptDPOGS205816-TA1206 bp
ProteinDPOGS205816-PA401 aa
Genomic positionDPSCF300081 - 607181-614844
RNAseq coverage0x (Rank: top 97%)
Annotation
HeliconiusHMEL0036033e-2729.00% 
BombyxBGIBMGA014468-TA3e-2931.80% 
Drosophila% 
EBI UniRef50UniRef50_A4KWG02e-0945.71%Reverse transcriptase n=5 Tax=Endopterygota RepID=A4KWG0_OSTNU
NCBI RefSeqXP_003099612.12e-0848.28%hypothetical protein CRE_22920 [Caenorhabditis remanei]
NCBI nr blastpgi|2982043232e-0948.57%endonuclease-reverse transcriptase [Bombyx mori]
NCBI nr blastxgi|2982043659e-0939.51%endonuclease-reverse transcriptase [Bombyx mori]
Group
Gene OntologyGO:00039645.9e-09RNA-directed DNA polymerase activity
GO:00037235.9e-09RNA binding
GO:00062785.9e-09RNA-dependent DNA replication
KEGG pathway 
InterPro domain[22-70] IPR0004775.9e-09Reverse transcriptase
Orthology groupMCL20959 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205816-TA
ATGAAGGATTTTGATTGGAATAACAAAGGCATATTTATTGGTATACAAATGCATACAGAGTTCATTAGTAACCTGCGCTTCGCAGACGACATCATACTCTTCGCGAACACCCCAAAACAGCTCGAAAAGATGCTTGAAGAGTTAAACGGTGCCAGCCAAAGTGTTGGATTACAACTGAATACCTTGAAAACGAAAGTCGCCACAAATAGCTATCAGAGACTTATAACGGCCAGCCGACCTCCCCAACCCTCTAAGCGGGCCCCAAACGCGTCCTCCCGCACAAAATCCTCACCACTACCGGACTCCCTACCAGCCGCCCGTCCCGCCATCCTCGTTGCTATTAAGACGACCATAAAATCCCGCCAGGAGGCCGTCGAGGCCTTCTGTAAGAGCATCTCCTTCAGAGAGTATACCTACGCTCCCGCGAAGGTCTCCCCAGTCTCCAACAACAAGCTAAGAATCGAGTTCGACACCCAGGAGGTCAACGACTGCGACGACGCCCTCAAACGACTCCGAGGCAAGCCAGACGCCGCAGTGACCGCTGTGATTGAAGCCCATGCTCCAATTCAAAGGGATCACCTCGGACATGTCGCCTCAGAATCTGCCATCTGGTACGCCATCTTTAAGATGGGGAGAGTTCGTCTGGATCACCAGCGGGTTCATGCGGAAGAGTTCTCCCCATGCCTGCAGTGCCAGAAATGCCTCAATTTCGGTCATGTTAAAAAACATTGCCGGACTGAGGTTACACGCTGCGCTCACTGCGCTGCTACCAATCACACCCAGGACCACTGCCCAACCAAGGACTCACCCCATGCACCTAAGTGCTACAAGAGCACAGAACGCAACACCAAATTCAATCATAAACACCCCATCACACACAAGGCAACGTCGCAGAGATCCCCCATACTGACTTCCGTCCTTGCTAAAATAAACAATTCACGTTTGTTTTTGTCCGGGACGCTCCTCCAGACTGGTCGGCACCGGCTCCTCGTCTCCCTCCTCCTCCTACTCTCCGAGGACGGCGACGCTCCTCAGTCCTCAGTCCTCGCTCAGTCCGCGCCGAGTCCTCGCTCGGAGCGTCGTGTCGGCCGTCCGTGTGATTATTCGCGGCGGGCGACCGGTCGTGTATCAGGTGTAGTGTCAGTGACGAGGATGTCCATGCGTCTGAAGGCAGTTCGGGGCGGCGCGTGTGGCGCGGCCAGGTGA

Protein sequence:

>DPOGS205816-PA
MKDFDWNNKGIFIGIQMHTEFISNLRFADDIILFANTPKQLEKMLEELNGASQSVGLQLNTLKTKVATNSYQRLITASRPPQPSKRAPNASSRTKSSPLPDSLPAARPAILVAIKTTIKSRQEAVEAFCKSISFREYTYAPAKVSPVSNNKLRIEFDTQEVNDCDDALKRLRGKPDAAVTAVIEAHAPIQRDHLGHVASESAIWYAIFKMGRVRLDHQRVHAEEFSPCLQCQKCLNFGHVKKHCRTEVTRCAHCAATNHTQDHCPTKDSPHAPKCYKSTERNTKFNHKHPITHKATSQRSPILTSVLAKINNSRLFLSGTLLQTGRHRLLVSLLLLLSEDGDAPQSSVLAQSAPSPRSERRVGRPCDYSRRATGRVSGVVSVTRMSMRLKAVRGGACGAAR-