Monarch geneset OGS2.0

DPOGS210978
TranscriptDPOGS210978-TA948 bp
ProteinDPOGS210978-PA315 aa
Genomic positionDPSCF300004 - 178076-179798
RNAseq coverage5289x (Rank: top 2%)
Annotation
HeliconiusHMEL0250142e-11495.67% 
BombyxBGIBMGA006405-TA1e-11382.25% 
DrosophilaHrb98DE-PC3e-7669.57% 
EBI UniRef50UniRef50_B4GP017e-7470.56%GL13791 n=5 Tax=Coelomata RepID=B4GP01_DROPE
NCBI RefSeqNP_001093319.15e-8557.67%heterogeneous nuclear ribonucleoprotein A1 [Bombyx mori]
NCBI nr blastpgi|1537920098e-8457.67%heterogeneous nuclear ribonucleoprotein A1 [Bombyx mori]
NCBI nr blastxgi|1565490266e-11359.73%PREDICTED: heterogeneous nuclear ribonucleoprotein A1, A2/B1 homolog [Nasonia vitripennis]
Group
Gene OntologyGO:00001661.9e-27nucleotide binding
GO:00036761.1e-22nucleic acid binding
KEGG pathwaytca:6574657e-80 
 K12741 (HNRNPA1_3)maps-> Spliceosome
InterPro domain[88-198] IPR0126771.9e-27Nucleotide-binding, alpha-beta plait
[109-181] IPR0005041.1e-22RNA recognition motif domain
Orthology groupMCL10347 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210978-TA
ATGAGACCGCAAAACGGAGACGATGGCTTTGATGAGCCCGAGGTACATAGAAAAATTTTCATTGGAGGATTGAACTATAGGACCTCTGATGAATCCTTAAAAGCACATTTTGAAAAATGGGGTGAAATTGTCGATGTTGTTGTTATGAAGGACCCTAAAACGAAAAGATCCAGAGGTTTCGGATTTATCACGTATTCGAAAGCGCACATGGTCGATGATGCGCAACTTAATAGGCCACATCGTATAGATGGCCGTATGGTAGAAACTAAAAGAGCAGTAGCCAGACAAGATATCAAAAACCCGGAAGCTGGAGCTACGGTTAAAAAATTATTTATTGGTGGCATTAAAGACGATCATGATGAAGAACAACTTAGAGAATACTTTTCAAACTATGGAAATGTACAAAATGTCAGTATAGTTACTGATAAAGGAACCGGGAAAAAACGTGGCTTTGGCTTTGTTGAATTTGATGATTATGATCCAGTTGACAAAATTTGTTTGAATGCTCCCCACAAAATTAATGGTAAACAGTTAGATGTTAAAAAAGCACTCCCCAAAGATGGTTCTGACCAAAGAGGAGGGCGAGGTGGTGGTGGTGGAGGAAGAAATGACAACTGGGGCAATGGTGGAGGTTACGGAAACAACCAAGGAGGTTGGAACAACTCCAACCCATGGGACAACAACCAAGGAGGATGGGGTGGTAACCAGGGAGGTGGTGGTTACGGTGGGGGTGGTGGCAATCAAAGAGGAGGGTGGGGGAATCAAGACTTTGGTAATTACAATCAACAGGGATACAGTGGTGGTCCAACGCGTAACCAGCAATATAGCAACAATAGGTCCGCCCCTTATAACGTGAACCAAGGTGGTGGCGGCAGTGGCGGCGGCTACGGTGGCGGCAACTATGGCGGCGGTGGCGGCGGCAACCAAGGAAGAGGTGGAAGATTCTAA

Protein sequence:

>DPOGS210978-PA
MRPQNGDDGFDEPEVHRKIFIGGLNYRTSDESLKAHFEKWGEIVDVVVMKDPKTKRSRGFGFITYSKAHMVDDAQLNRPHRIDGRMVETKRAVARQDIKNPEAGATVKKLFIGGIKDDHDEEQLREYFSNYGNVQNVSIVTDKGTGKKRGFGFVEFDDYDPVDKICLNAPHKINGKQLDVKKALPKDGSDQRGGRGGGGGGRNDNWGNGGGYGNNQGGWNNSNPWDNNQGGWGGNQGGGGYGGGGGNQRGGWGNQDFGNYNQQGYSGGPTRNQQYSNNRSAPYNVNQGGGGSGGGYGGGNYGGGGGGNQGRGGRF-