Monarch geneset OGS2.0

DPOGS209116
TranscriptDPOGS209116-TA1869 bp
ProteinDPOGS209116-PA622 aa
Genomic positionDPSCF300483 - 753-2621
RNAseq coverage102x (Rank: top 61%)
Annotation
HeliconiusHMEL0121621e-3829.86% 
BombyxBGIBMGA001404-TA1e-1731.19% 
Drosophila% 
EBI UniRef50UniRef50_Q1ZBP66e-7934.26%Putative uncharacterized protein (Fragment) n=1 Tax=Psychromonas sp. CNPT3 RepID=Q1ZBP6_9GAMM
NCBI RefSeqXP_001599192.12e-7233.33%PREDICTED: similar to pol-like protein [Nasonia vitripennis]
NCBI nr blastpgi|904094372e-7834.26%hypothetical protein PCNPT3_00010 [Psychromonas sp. CNPT3]
NCBI nr blastxgi|904094378e-8034.86%hypothetical protein PCNPT3_00010 [Psychromonas sp. CNPT3]
Group
Gene OntologyGO:00039643.7e-42RNA-directed DNA polymerase activity
GO:00037233.7e-42RNA binding
GO:00062783.7e-42RNA-dependent DNA replication
KEGG pathwayhmg:1002100841e-10 
 K12839 (SMNDC1, SPF30)maps-> Spliceosome
InterPro domain[173-418] IPR0004773.7e-42Reverse transcriptase
Orthology groupMCL24215 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209116-TA
ATGAGTCCAGAGAATTTCACCTCTTATCAGAGAGTGGCAGCCCAGACTATCAGGTTTCTGTCAGAAAAGAAAAGATCTGGTTGGTTTCGCTTCTGTGAAGAACTCTCTCCTAGTACTCCCCCTACTAAGATTTGGAAGAATTTGAGACGTTTTCGGAATTCTGTTTCTGGTGCTATCATTTCATCTAATGACTCCTCAGGCTGGATTGACCAGTTCTCCTTCAAATTAGCCCCTCCTTCAGTTCCTTCGTTGGAAGAACTTTTCCCCCCCTTATCGCATTGTTATTCTTCTGACAAGTTTGAGTCCCCTTTCTCTTGGGAGGAGCTTTCAACAGTCCTTGAGGGTCTCAAGGACTCATCGCCAGGAATAGATGGTATTCCTTATTCTTTTATTTACAATTCTTCTACCCCCACGAAACTTGTTTTTTTATCCATCCTTAATAACATATTTTTAGCCTCCACTCCTCCCGAGGAGTGGAGAACACAATTAATTATTCCCATCTTGAAACATGGAAAGCCCAACTCCGACCCTTCTAGCTACCGACCAATAGCTCTGTCTTGCACTATGGCCAAGATATTAGAGCATTTGATTAAAAATCGATTAGAATGGTTTGTAGAGAGCAAAAAATTATTAGCTAAATCTCAATTTGGCTTCCGCAAAGGATTTGGTACCATCGACAGCTTAAGTATAATTCTGACCGACATCCGCATTGCTCTATCAAAAAATGAATGTGTGGTTGGCGTTTTTTTAGATATTTCTTCAGCTTATGATAATGTTCTTCTTCCAATACTCAGGCAGAAAATGCTCCAGCTGAGTATTCCTGCGAGGTTGCTAAACATTATCCTCAGTCTTCTGTCTTCTAGATCTGTTTCCATTCGCTCTCCTAACTATAATTCTTCTCCTAGACAAGTATGGAAAGGGCTTCCCCAAGGCTCAGTCCTTAGCCCGTTACTCTTTAGTATGTACACATTCGACTTAGAACTCTCAGTCAATCCTTTTTGTGAAGTCCTCCAATATGCCGACGACTTGGCTCTTTATGTCTCCGCAAAGAAAATTGATGAGGCCTCTTCCCGTCTCAACTCAGCTGTAAGCTACCTTCAGGATTGGCTGCATAACCACGGGTTATCTCTATCTATTCCTAAAAGCAAAGTGGTAGTTTTTTCTCGTTTCAGATCTATTCCAGATATCTCTATTTCTTATAGACAACAAAAGTTTATGGTTAAGGATAAAGTCAACTTTCTTGGGTTTACTTTGGACTCGAGGCTAACTGGCATCCAACATATAAATAATATTATGAAAAAATGTGAAAATAATATTAACATTTTGCGTTCTCTTTCTGGTGTTTGGTGGGGCAGCCACCCCTATACTCAAAAAATTTTATACAATGCTATAATACGCAGTCATTTTGATTATGGATCCTTTCTCCTTGTCCCTTGTATTAAATCTGCCTTGTCTATTCTTGATAAAATTCAAGCTAAATGCCTGAGAATAATTTGTGGGGCTATGAAATCATCTCCAATTAACGCTCTTCAGGTAGAATGTGGTGAAGCCCCTCTGCATCTTAGAAGACAATACTTAAGTGACCGTTTCTTTTTAAAAGTCATTCAATTTTCTAATCACCCCCTCATTCCTAAACTGAACTCTCTCTCTGATCTCATTCCTTCTAACAAGTATTGGTCCTATAAAAACTATTCTTGTCTCCTTACTAGTTTAGTCAAATTCCTTCGTCTCCCTTGTCCCGTTCTACAGAACCAAATGTTCCCGCTTTTTGCCATTCCATTTGATGTCCTTAACTTCCATCCTCAAATTTTGAGTTTGGCATTGATAAAGTTTCTGCTATTGCAAACGTTCAATTTCAAAATTACGTAA

Protein sequence:

>DPOGS209116-PA
MSPENFTSYQRVAAQTIRFLSEKKRSGWFRFCEELSPSTPPTKIWKNLRRFRNSVSGAIISSNDSSGWIDQFSFKLAPPSVPSLEELFPPLSHCYSSDKFESPFSWEELSTVLEGLKDSSPGIDGIPYSFIYNSSTPTKLVFLSILNNIFLASTPPEEWRTQLIIPILKHGKPNSDPSSYRPIALSCTMAKILEHLIKNRLEWFVESKKLLAKSQFGFRKGFGTIDSLSIILTDIRIALSKNECVVGVFLDISSAYDNVLLPILRQKMLQLSIPARLLNIILSLLSSRSVSIRSPNYNSSPRQVWKGLPQGSVLSPLLFSMYTFDLELSVNPFCEVLQYADDLALYVSAKKIDEASSRLNSAVSYLQDWLHNHGLSLSIPKSKVVVFSRFRSIPDISISYRQQKFMVKDKVNFLGFTLDSRLTGIQHINNIMKKCENNINILRSLSGVWWGSHPYTQKILYNAIIRSHFDYGSFLLVPCIKSALSILDKIQAKCLRIICGAMKSSPINALQVECGEAPLHLRRQYLSDRFFLKVIQFSNHPLIPKLNSLSDLIPSNKYWSYKNYSCLLTSLVKFLRLPCPVLQNQMFPLFAIPFDVLNFHPQILSLALIKFLLLQTFNFKIT-