Monarch geneset OGS2.0

DPOGS202738
TranscriptDPOGS202738-TA1326 bp
ProteinDPOGS202738-PA441 aa
Genomic positionDPSCF300284 + 167108-180494
RNAseq coverage357x (Rank: top 33%)
Annotation
HeliconiusHMEL0126801e-11975.06% 
BombyxBGIBMGA005357-TA5e-11978.37% 
Drosophilashep-PA2e-8175.77% 
EBI UniRef50UniRef50_E2A5I62e-9756.25%RNA-binding motif, single-stranded-interacting protein 1 n=13 Tax=Endopterygota RepID=E2A5I6_CAMFO
NCBI RefSeqXP_393384.36e-9152.81%PREDICTED: similar to CG32423-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3071851036e-9756.25%RNA-binding motif, single-stranded-interacting protein 1 [Camponotus floridanus]
NCBI nr blastxgi|3454939062e-10250.32%PREDICTED: protein alan shepard-like isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00001668.4e-20nucleotide binding
GO:00036762.1e-18nucleic acid binding
GO:00037231.1e-10RNA binding
KEGG pathway 
InterPro domain[98-184] IPR0126778.4e-20Nucleotide-binding, alpha-beta plait
[112-185] IPR0005042.1e-18RNA recognition motif domain
[111-126] IPR0023431.1e-10Paraneoplastic encephalomyelitis antigen
Orthology groupMCL10624 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202738-TA
ATGGCTAGCGCGGGCGCGCAGTACCGCGGCGGCGCGCAGCAGTGGGCCGCTGCATACGCGCCGCAGCCCTGCCGCTACCCGCCGCCACAGCAGCACTACGCAGCGGCACCCAGCCCTTTCACTCCCCATCAGTACACGAGTGCAACGATGGGGGGAGGATGCGGCGCCACGGGTACGAGGGTACCGACGGCTGCATCGCCGGCTAACACGGCGTCCAGCTCGTCGTCCAACTCCGCGGGGGGTACTCGATCTACCAGCCTAAGTACCGCCCCGCCTCCGCCCGCCTCCTCAGCCTCAACTACAGGCGCGTCCGGCGAACAACTCAGCCGCACCAATCTCTACATACGCGGCCTGAGCCAGACCACAACCGATAAAGACCTCGTCCAAATGTGCCAGATGTATGGCAACATAATATCAACAAAAGCAATATTGGATAAAAATACAAATAAATGTAAAGGTTACGGTTTTGTAGATTTTGAAACAATCGCGTCAGCTGAGGCCGCTGTAAAAGGATTACAAGCCAAAGGTGTTCAGGCCCAAATGGCTAAAGTGGGTATCTGGTTCCTGCGTAGACTGAACCGTCAACAGGAACAGGATCCAACCAACCTGTATATGGCCAACTTGCCACCGCACTTTAAAGAGAACGATGTTGACCAACTGTTGGCCAAGTTTGGTCAAGTCGTGTCCACGAGGATCCTGCGTGATACCCATGGACACAGCAAAGGCGTAGGCTTCGCAAGAATGGAGTCTAGAGAAAAATGCGAGCAGATTATCCAAATGTTCAATGGAAATCCGATACCGGGCGCTAAGGAGCCTTTGCTTGTGAAGTTTGCCGATGGGGGTAATAAGAAAAAGGCTCTGTATAATAAGCAGAATGACAACAACGGTAGAGTGTGGCGAGACAACAATGATTCCATCACTCAGGCGATGAGTGTGACGGGTGTGTACGCTAGCGGTGTCGGCGGCGCCGGCGGGGAGTGCGGCGTGTACCGCAGTAATGTATACGGCGTGGCGTTCCATCCTCAGCTCCACGCTCCAGCCTGGCTGCCGTACGCGGCACTGCTGCCGCCCGCCCATCACGCGCCCCACGCCCCGCACCCCGCACACCCCGCACACCCACAGCATCTGCCCATCGATACCGTGCCTAGTCAATATGTGAACTGGGACAGCTTAAGGCCGGAAAATGAATTATACTACTTCGCGTCGCACCCTTACCAGTATTTCACTGGTCCAACACCGCCCATCATTCAGATGCCGATGGAGAGCGAGCATGCGTCGACGGCCGCCTCCCCGGACGAGGCCTACCAGCCTTACCCCCCCAAGTAG

Protein sequence:

>DPOGS202738-PA
MASAGAQYRGGAQQWAAAYAPQPCRYPPPQQHYAAAPSPFTPHQYTSATMGGGCGATGTRVPTAASPANTASSSSSNSAGGTRSTSLSTAPPPPASSASTTGASGEQLSRTNLYIRGLSQTTTDKDLVQMCQMYGNIISTKAILDKNTNKCKGYGFVDFETIASAEAAVKGLQAKGVQAQMAKVGIWFLRRLNRQQEQDPTNLYMANLPPHFKENDVDQLLAKFGQVVSTRILRDTHGHSKGVGFARMESREKCEQIIQMFNGNPIPGAKEPLLVKFADGGNKKKALYNKQNDNNGRVWRDNNDSITQAMSVTGVYASGVGGAGGECGVYRSNVYGVAFHPQLHAPAWLPYAALLPPAHHAPHAPHPAHPAHPQHLPIDTVPSQYVNWDSLRPENELYYFASHPYQYFTGPTPPIIQMPMESEHASTAASPDEAYQPYPPK-