Monarch geneset OGS2.0

DPOGS209639
TranscriptDPOGS209639-TA1173 bp
ProteinDPOGS209639-PA390 aa
Genomic positionDPSCF300015 + 1038381-1039553
RNAseq coverage17452x (Rank: top 1%)
Annotation
HeliconiusHMEL0170422e-13186.98% 
BombyxBGIBMGA006704-TA7e-12677.58% 
Drosophilavig-PC8e-4255.03% 
EBI UniRef50UniRef50_E2BN245e-4250.64%Plasminogen activator inhibitor 1 RNA-binding protein n=9 Tax=Formicidae RepID=E2BN24_HARSA
NCBI RefSeqXP_392925.27e-4942.99%PREDICTED: similar to vasa intronic gene CG4170-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838560425e-5043.55%PREDICTED: uncharacterized protein LOC100878412 [Megachile rotundata]
NCBI nr blastxgi|3838560425e-8247.88%PREDICTED: uncharacterized protein LOC100878412 [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[171-277] IPR0068618.4e-27Hyaluronan/mRNA-binding protein
Orthology groupMCL17369 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209639-TA
ATGGAGAATTCCTACGGTGTGGGGACCGATAACAGATACGCTCTTTTCTTGGACGATGAGTCCGATCCTCTTGATGCGTTAAAAGCTCGAGAGCAAGCAAAAGAGCTCAAGAAAAAGACCAAAGAGGCGGAAAAAGAGAACAAGGGGAAACCTGAGGCCAAACCTAAGGGCGGCAGTGTCGTCACGAGGAAGGGCATCAAGGAAACTCAGAACGTGAAGTCTCAAGAAATTAAGAGTGGTGAACAACAAAAGAACAAAGGTCCACCTCGGACAAACGATCGCAATTCGGAGCGGCCGCCGCCTCGGCGTCGCGAGGATCGTCCTCAGAACGGAGCCGTCGAAGGTAAGGAGGGTGCGCGACCTCCTCGCAGGGAATTCGGTGACCGAAGGCCTAATTTTGAGCGACGTAACTTCAACGATAACTCTGAAGGAGGAGAACGTCGCGGTCCGAGGCCCCCACGTGAGCCGCGTGAAGGACAACGTGGTCCGCGACCGGGCTTCGATGGCCGAGGCAAGCGTGAGTTTGACAGACGGTCAGGCTCTGACAAAACTGGCGTTAAACCGGTAGACAAGCGCGAGGGTGCCGGTCCTCACAATTGGGGAACTATAAAGGACGATATCGATGAATTGAACAAAACCGGATCCGAAGGTGAAGTGGCCGAGGAGAAAGCGCCGGACGCGACCGGCGCTGGCGACGGTCAGCAATCGGAGCCCGAGCGCGCTCCGCCCGCCGAGGAGGAGCCCCGCGAGCTCACCCTGGACGAGTACAAGGCGCTCCGTAACGCTCAACGCATGGCTCCACAGTACAATCTCCGGAAGGCCGGCGAAGGTGAGGATCTTAGCCAGTGGAAGAATCTAGTTCTGCTGGAGAAGAAGAAGGAGGGGGGCGAGGAGGACGACAGTGATGAGGAGTACGACATCGCTGACTACCCCCAGCGAGTCGGACGTCAGAAACGCGTGCTGGGCATTGAGTTCACTTTCAACGACACGGCGCGGCGCGGAGGCACCGGCGGCCGCGGGCGAGGACGCGGACGCGGGCGCGGAGGGCGCGGTGGAGGGACGGGGGCCGGAGTGCCGCGCGAGGAAGCTGCACCCGAGGAGCGGCCGGCGGCCAAGAACCAGGTCGCTCCGCCGAAGGTGGACGACAGCAAGGATTTCCCTTCACTCAGCTAG

Protein sequence:

>DPOGS209639-PA
MENSYGVGTDNRYALFLDDESDPLDALKAREQAKELKKKTKEAEKENKGKPEAKPKGGSVVTRKGIKETQNVKSQEIKSGEQQKNKGPPRTNDRNSERPPPRRREDRPQNGAVEGKEGARPPRREFGDRRPNFERRNFNDNSEGGERRGPRPPREPREGQRGPRPGFDGRGKREFDRRSGSDKTGVKPVDKREGAGPHNWGTIKDDIDELNKTGSEGEVAEEKAPDATGAGDGQQSEPERAPPAEEEPRELTLDEYKALRNAQRMAPQYNLRKAGEGEDLSQWKNLVLLEKKKEGGEEDDSDEEYDIADYPQRVGRQKRVLGIEFTFNDTARRGGTGGRGRGRGRGRGGRGGGTGAGVPREEAAPEERPAAKNQVAPPKVDDSKDFPSLS-