Monarch geneset OGS2.0

DPOGS208739
TranscriptDPOGS208739-TA1101 bp
ProteinDPOGS208739-PA366 aa
Genomic positionDPSCF300043 + 312893-314923
RNAseq coverage2469x (Rank: top 5%)
Annotation
HeliconiusHMEL0152438e-14374.23% 
BombyxBGIBMGA003401-TA1e-17891.20% 
Drosophilapum-PA1e-15479.10% 
EBI UniRef50UniRef50_D7EK093e-16981.53%Pumilio n=2 Tax=Tribolium castaneum RepID=D7EK09_TRICA
NCBI RefSeqXP_967865.22e-16981.53%PREDICTED: similar to pumilio [Tribolium castaneum]
NCBI nr blastpgi|2894492297e-17589.17%pumilio [Bombyx mori]
NCBI nr blastxgi|2894492295e-17690.57%pumilio [Bombyx mori]
Group
Gene OntologyGO:00054881.2e-135binding
GO:00037231.8e-09RNA binding
KEGG pathway 
InterPro domain[47-350] IPR0119891.2e-135Armadillo-like helical
[48-346] IPR0160245.8e-111Armadillo-type fold
[139-173] IPR0013131.8e-09Pumilio RNA-binding repeat
Orthology groupMCL11292 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208739-TA
ATGACTCCGCATCTCACAGAACGTGTCGACAAGCTGTCGGAGAGTTGCAGAAGATACAAGAACCAGATACGTCTCCTGGTGAACAAGCTGAAGGAGGCTGGGATTGAAGACGTTCAAGATATACTTGAAGGAAATGGTGAGTTTATACAACAGAAGTTGGAGAGAGCAACGGTTCAAGAGAAACAGATGGTGTTTAATGAAATTATCGGCGCCGCGTACAGTCTCATGACGGACGTGTTCGGCAACTACGTCATACAGAAGTTCTTCGAGTTCGGCACGACGGAACAGAAGACGACGCTAGCTCAAAAGGTGCGCGGACACGTGCTGGCGCTGGCGCTGCAGATGTACGGCTGTCGGGTCATACAGAAGGCGCTGGAGTCCATACCGCCCGAACAGCAACAGGAAGTGGTCAGGGAGCTGGACGGACACGTGCTCAAGTGTGTCAAGGACCAGAACGGCAACCATGTTGTACAAAAATGTATAGAGTGCGTCGAACCGGCGGCGTTACAGTTTATAATAAACGCGTTTGCCGGTCAAGTGTACGCGCTGTCGACTCACCCCTACGGCTGCAGAGTGATACAACGCATCCTGGAGCACTGCACGGCCGAGCAGACCGCGCCCGTCCTGGCCGAGCTACACGCACACACCGACCAGCTCATACAGGACCAGTACGGTAACTACGTGGTACAACATGTGCTGGAGCACGGCGCGGCGGAGGACAGGTCGCGGCTGGTGGCCGGCGTGCGGGGGAAGGTGCTGCAGCTGTCACAGCACAAGTTCGCCTCCAACGTGGTCGAGAAGTGTGTGACGCACGCCACGCGCAACGAACGCGCTCTGCTCATCGACGAGCTGTGCGGGTTTAACGATAACGCTCTCCACGTCATGATGAAGGACCAGTACGCCAACTACGTGGTCCAGAAAATGATCGACGTGGCGGAGCCGACGCAGCGGAAGGTGCTGATGCACAAGATCCGACCTCACATCGGCTCGCTGCGCAAGTACACGTACGGAAAACACATCATCGCCAAGCTGGAGAAGTTCTTCATGAAGGCGCCGGAGCTGGGCCCCATCGGCCCGCCGCCGCCCAACCCGGTCCTGTAG

Protein sequence:

>DPOGS208739-PA
MTPHLTERVDKLSESCRRYKNQIRLLVNKLKEAGIEDVQDILEGNGEFIQQKLERATVQEKQMVFNEIIGAAYSLMTDVFGNYVIQKFFEFGTTEQKTTLAQKVRGHVLALALQMYGCRVIQKALESIPPEQQQEVVRELDGHVLKCVKDQNGNHVVQKCIECVEPAALQFIINAFAGQVYALSTHPYGCRVIQRILEHCTAEQTAPVLAELHAHTDQLIQDQYGNYVVQHVLEHGAAEDRSRLVAGVRGKVLQLSQHKFASNVVEKCVTHATRNERALLIDELCGFNDNALHVMMKDQYANYVVQKMIDVAEPTQRKVLMHKIRPHIGSLRKYTYGKHIIAKLEKFFMKAPELGPIGPPPPNPVL-