Monarch geneset OGS2.0

DPOGS210987
TranscriptDPOGS210987-TA924 bp
ProteinDPOGS210987-PA307 aa
Genomic positionDPSCF300004 + 180950-184425
RNAseq coverage398x (Rank: top 30%)
Annotation
HeliconiusHMEL0250141e-7463.06% 
BombyxBGIBMGA007822-TA1e-8666.53% 
DrosophilaHrb87F-PC1e-7270.79% 
EBI UniRef50UniRef50_UPI00017930C52e-8255.36%UPI00017930C5 related cluster n=1 Tax=unknown RepID=UPI00017930C5
NCBI RefSeqNP_001093319.16e-10968.32%heterogeneous nuclear ribonucleoprotein A1 [Bombyx mori]
NCBI nr blastpgi|1537920091e-10768.32%heterogeneous nuclear ribonucleoprotein A1 [Bombyx mori]
NCBI nr blastxgi|1537920096e-12168.71%heterogeneous nuclear ribonucleoprotein A1 [Bombyx mori]
Group
Gene OntologyGO:00001663.1e-27nucleotide binding
GO:00036761.1e-22nucleic acid binding
KEGG pathwayapi:1001688771e-82 
 K12741 (HNRNPA1_3)maps-> Spliceosome
InterPro domain[95-203] IPR0126773.1e-27Nucleotide-binding, alpha-beta plait
[116-188] IPR0005041.1e-22RNA recognition motif domain
Orthology groupMCL10347 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210987-TA
ATGAAGCCAAGCGGCGGGGACGATGACTATGAGGGTGATGATTTTTTACAAGAGGAACCTGAACATACAAGGAAGTTGTTTATTGGTGGACTAGATTATTGTACGACAGATACGTCACTGAAAGAGTTTTATGAACAATGGGGGGATATTGTTGATGTTGTTGTTATGAAGGATCCACAGACCAAGAGGTCTCGCGGTTTCGGTTTTATAACTTATTCCCGGGCTCATATGGTAGATGATGCTCAGAAAAACAGACCTCACAAGATTGATGGAAGAATTGTAGAACCAAAGCGAGCTGTACCAAGAGAAGAGATCAAAAGGCCGGATGCAAGTGCAACAGTCAAGAAACTGTTTATTGGAGGTATAAAACAAGATATTGAAGAGGAAGACTTGAGAGAATACTTCAGCAAATTTGGCGAAATTATTTCAGTGAGTTTAGTAACAGAAAAAGACACTGGAAAGAAAAGAGGCTTTGGATTTATTGAATTTGATGATTACGATCCAGTTGATAGAATATGCTTACAACCAAGCCACAAGGTTAAAGGTCGTCGCTTGGATGTTAAAAAGGCTATATCAAAGACAGTGATGGCATCTAGTGGTAGTAGGGGACGGAGTGGAAGATGGGGTAATAGACAATTACAGGATTGGAATACAGATGGCGCCTTTGATTCAGGCTACGAGCAGGGTTGGAACAACCAGAATCCTTGGGATAATTCAGGCAATTGGGGCAACCAGGGTTATGATCAAGGTTGGCAGGGCGACTTTGGGAGTGGATATCAACAAAACTATGGTGGAGGGCCAATGAGACCTAATTTTAGTAATACACGACAACAGCCCTATAATGCAGGTGGTGGCTACAACAGCGGCAATCCTTCCAACTTCAACAACTCTGGCAACACTCCCAACAACTCTCGTCGTTTTTAA

Protein sequence:

>DPOGS210987-PA
MKPSGGDDDYEGDDFLQEEPEHTRKLFIGGLDYCTTDTSLKEFYEQWGDIVDVVVMKDPQTKRSRGFGFITYSRAHMVDDAQKNRPHKIDGRIVEPKRAVPREEIKRPDASATVKKLFIGGIKQDIEEEDLREYFSKFGEIISVSLVTEKDTGKKRGFGFIEFDDYDPVDRICLQPSHKVKGRRLDVKKAISKTVMASSGSRGRSGRWGNRQLQDWNTDGAFDSGYEQGWNNQNPWDNSGNWGNQGYDQGWQGDFGSGYQQNYGGGPMRPNFSNTRQQPYNAGGGYNSGNPSNFNNSGNTPNNSRRF-