Monarch geneset OGS2.0

DPOGS207570
TranscriptDPOGS207570-TA1623 bp
ProteinDPOGS207570-PA540 aa
Genomic positionDPSCF300072 - 367163-392571
RNAseq coverage627x (Rank: top 20%)
Annotation
HeliconiusHMEL0145675e-17596.88% 
BombyxBGIBMGA004733-TA3e-12384.39% 
Drosophilaheph-PY1e-17262.18% 
EBI UniRef50UniRef50_Q95UI63e-16961.96%Hephaestus n=13 Tax=Drosophila RepID=Q95UI6_DROME
NCBI RefSeqXP_966484.20.069.23%PREDICTED: similar to polypyrimidine tract binding protein [Tribolium castaneum]
NCBI nr blastpgi|1892413130.069.23%PREDICTED: similar to polypyrimidine tract binding protein [Tribolium castaneum]
NCBI nr blastxgi|2700131600.072.32%hypothetical protein TcasGA2_TC011728 [Tribolium castaneum]
Group
Gene OntologyGO:00063973.6e-179mRNA processing
GO:00056343.6e-179nucleus
GO:00037233.6e-179RNA binding
GO:00001661.5e-25nucleotide binding
GO:00036761.3e-08nucleic acid binding
KEGG pathway 
InterPro domain[63-540] IPR0065363.6e-179HnRNP-L/PTB/hephaestus splicing factor
[161-262] IPR0126771.5e-25Nucleotide-binding, alpha-beta plait
[347-416] IPR0005041.3e-08RNA recognition motif domain
Orthology groupMCL10706 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207570-TA
ATGATGAAAAGTATCGCTGTTGATGGTAACCGTAGCACTGAGAGTCCAGTTACAAAGGCGGCATTTGTCGAGAGGTGGGCAAACATTGATGCCTGCGCCGGTGTATTTTGTACGCCGGTCGGCCATGTTAAAATGCCAAAAGCAGGATTCTGCGGTGGTGAATTGCGCGCACATAGAATACGATGCACGCAAGCATTCCTGGAAATGGCGGAAGAAATATCTGCTGTAGCCATGGTGGCATACTTCGGTGGCTGCGTGGCTCAACTTCGTGGGCGAGCTGTCTACGTGCAGTTCTCTAACCATAAGGAACTAAAAACGGACCAGACTCACAGCAACGCGTCGGCGTCAGCACAAGCGGCACTTCAAGCTGCTCAAGCCATATCTAACAGCGCTGGTGCAGCCCTTGTAGCTGGCGCGGGCGCAGCACTACAACCAGCCAACGGAGGAACCGAAATACAATGTGACCTGCAGGGTGGCCCCAACACTGTCCTGCGCGTTATAATAGAACATATGCTGTATCCGATAGTACTTGATGTACTTTACTCGATCTTCCAGCGATACGGAAAGGTGCTCAAAATTGTCACGTTCACTAAAAACAATTCATTTCAAGCCCTAATCCAGTACCCAGACACCGGGTCAGCGCAAATGGCGAAAACGGCTCTGGATGGTCAGAACATATATAACGGATGCTGCACTCTCCGCATTGACTACTCAAAGATGACCTGCCTCAATGTCAAATATAACAATGATAAGTCCCGAGACTACACTAACCCGACGCTTCCATCTGGAGACGGAGACGCTCATCAGCTGTTGACCAGTGAGTTGATGCCACTGCGGGCCCACCTCGCCCTGAACCTCGCATCCAGGATGAGCGGAGGGGTATTGGCACCACCGTTCCTGGGTCTGGGATCGGGGCTAGCTTCTCCCTATGGTGTGGGAGTGGGGGGAGTAAGCCTCCCAGCGTTTGGGGCTCTCGCTTTGACTCCAGCATCACCAGCACTGCGAGCTTTAGCAGCTCCACCACTACCGCTAGCTACAGTACTGCTGGTTTCAAATCTCAACGAGGAGATGGTCACGCCTGACGCTCTATTTACCCTTTTCGGCGTGTACGGTGATGTGCAACGGGTGAAGATCCTCTACAATAAAAAGGACTCAGCCCTCATCCAAATGGCAGAACCTCATCAGGCTCATCTAGCCATGACTCATATGGATAAGCTTCGGGTGTTCGGTAAAGCAATGCGCGTGATGCTCAGCAAGCATCAGACCGTGCAACTGCCCAAAGAGGGACAGCCAGATGCAGGACTCACCCGGGACTATTCCCAGAGCCCACTCCATAGATTCAAGAAGCCTGGTAGCAAGAACTACCAGAACATTTACCCTCCGAGCGCTACATTACATTTATCCAACATTCCAGCTACCGTTACTGAAGATGACATCAAGGAAGCGTTCACTAAACGAGGTTTCACCATCAAGGCGTTTAAATTCTTCCCGAAAGACCGCAAGATGGCTCTGGTCCAACTGCCATGCATCGACGACGCTGTGGCTGCCCTCATCAAGATGCACAACCACCAGCTCTCGGAGTCCAATCATCTAAGGGTCTCCTTCTCCAAATCAAGTATTTAA

Protein sequence:

>DPOGS207570-PA
MMKSIAVDGNRSTESPVTKAAFVERWANIDACAGVFCTPVGHVKMPKAGFCGGELRAHRIRCTQAFLEMAEEISAVAMVAYFGGCVAQLRGRAVYVQFSNHKELKTDQTHSNASASAQAALQAAQAISNSAGAALVAGAGAALQPANGGTEIQCDLQGGPNTVLRVIIEHMLYPIVLDVLYSIFQRYGKVLKIVTFTKNNSFQALIQYPDTGSAQMAKTALDGQNIYNGCCTLRIDYSKMTCLNVKYNNDKSRDYTNPTLPSGDGDAHQLLTSELMPLRAHLALNLASRMSGGVLAPPFLGLGSGLASPYGVGVGGVSLPAFGALALTPASPALRALAAPPLPLATVLLVSNLNEEMVTPDALFTLFGVYGDVQRVKILYNKKDSALIQMAEPHQAHLAMTHMDKLRVFGKAMRVMLSKHQTVQLPKEGQPDAGLTRDYSQSPLHRFKKPGSKNYQNIYPPSATLHLSNIPATVTEDDIKEAFTKRGFTIKAFKFFPKDRKMALVQLPCIDDAVAALIKMHNHQLSESNHLRVSFSKSSI-