Monarch geneset OGS2.0

DPOGS212619
TranscriptDPOGS212619-TA1041 bp
ProteinDPOGS212619-PA346 aa
Genomic positionDPSCF300245 + 169653-173491
RNAseq coverage362x (Rank: top 33%)
Annotation
HeliconiusHMEL0024783e-2290.20% 
BombyxBGIBMGA005213-TA7e-4872.93% 
DrosophilaCG2931-PA1e-5740.90% 
EBI UniRef50UniRef50_D2A2H64e-6041.33%Putative uncharacterized protein GLEAN_07648 n=1 Tax=Tribolium castaneum RepID=D2A2H6_TRICA
NCBI RefSeqXP_968207.18e-6141.33%PREDICTED: similar to poly(A)-binding protein, putative [Tribolium castaneum]
NCBI nr blastpgi|910804232e-5941.33%PREDICTED: similar to poly(A)-binding protein, putative [Tribolium castaneum]
NCBI nr blastxgi|3407180382e-6845.71%PREDICTED: hypothetical protein LOC100647660 [Bombus terrestris]
Group
Gene OntologyGO:00001663.5e-16nucleotide binding
GO:00036761.3e-15nucleic acid binding
KEGG pathway 
InterPro domain[188-275] IPR0126773.5e-16Nucleotide-binding, alpha-beta plait
[214-274] IPR0005041.3e-15RNA recognition motif domain
Orthology groupMCL16314 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212619-TA
ATGGACAACTCCAGCAAACTACAACAGATGCAAGATGAAATGTCCAGATTCGAGGCAGAGATCTCAGGATCGGATCTATTGGCCATGCGCCCAGTTATAGGTGCGGGAACCTTTGGGGCTGTACAGCAACAATTAGAGCGGGCTCTGCCTCATCCTCCAGCGATGATGCCACAGTTCGATGTCATGGGAGCTCGCTTGATGTACCCGACTGTACCACCACCGCCTCCACCACCATCCATGATGGTACCAGCGTCGGTGCAAAGACGTCCCCCTTGTAGTGAAGTATTTCCCCCACATCCATTCATAAGACCCGGGATTCCCGGACCCCAAATACCACCACCACCTCCAAAACCACAGCCGGTAGTGCTCACTGCGGCCCCCAAACTGTACAAACAATCCAAGGAAGACGAAGCCCTGAAAGACCAACTTGATGGCAAGAAAAGGAAACGAGAACGTCCATTGACTCCTCCCCGAGAGCCTGAGATAGAGCCATTACCCCCACCACCAATCGTCCACATAGACACCAGTGCACCAAAGGCCAAAAAAGAGAAAAAACATAGAAAGGTTGTGCGAACAGCCGGTGGTCAGACTTGGGAGGATGTGACCTTACTTGACTGGCCCGACGATGACTTCAGGATGTTCTGCGGAGATCTTGGAAACGACGTCACTGATGAACTACTGACTAGAACATTTGGCAAGTACAGTTCATTTCAAAGAGCAAAAGTTATTAGAGACAAGCGGACGAACAAAAGCAAGGGTTTTGGTTTCGTCAGTTTCAAAGATCCCGGGGACTTCATTAAGGCCATGAAGGAAATGGACGAATCTGTCCAGGAAGTACTGTATGGAGTTCTCTATCCCGGGATCGACCTCGTGGGATTCCTTGAGCTCCAACACCCCCTGGGCCATCGACGCTACGTTGGTAGTAGACCGATAAAGCTAAGGAAGAGCACGTGGAAGAATCGCTCGCTGGATGTAGTGAGGAAGAAAGAGAAAGAGAAGGCTGCTCTGCTCTCACAACTCATGTCAGGAAATAAAAGTTGA

Protein sequence:

>DPOGS212619-PA
MDNSSKLQQMQDEMSRFEAEISGSDLLAMRPVIGAGTFGAVQQQLERALPHPPAMMPQFDVMGARLMYPTVPPPPPPPSMMVPASVQRRPPCSEVFPPHPFIRPGIPGPQIPPPPPKPQPVVLTAAPKLYKQSKEDEALKDQLDGKKRKRERPLTPPREPEIEPLPPPPIVHIDTSAPKAKKEKKHRKVVRTAGGQTWEDVTLLDWPDDDFRMFCGDLGNDVTDELLTRTFGKYSSFQRAKVIRDKRTNKSKGFGFVSFKDPGDFIKAMKEMDESVQEVLYGVLYPGIDLVGFLELQHPLGHRRYVGSRPIKLRKSTWKNRSLDVVRKKEKEKAALLSQLMSGNKS-