Monarch geneset OGS2.0

DPOGS212345
TranscriptDPOGS212345-TA1230 bp
ProteinDPOGS212345-PA409 aa
Genomic positionDPSCF300019 - 149111-151844
RNAseq coverage150x (Rank: top 53%)
Annotation
HeliconiusHMEL0082195e-6976.05% 
BombyxBGIBMGA004663-TA1e-14978.14% 
DrosophilaCG7102-PA1e-11956.58% 
EBI UniRef50UniRef50_Q9VLV62e-11756.58%CG7102 n=27 Tax=Neoptera RepID=Q9VLV6_DROME
NCBI RefSeqNP_787994.13e-11856.58%CG7102 [Drosophila melanogaster]
NCBI nr blastpgi|3228022551e-13657.99%hypothetical protein SINV_02639 [Solenopsis invicta]
NCBI nr blastxgi|3228022553e-13357.67%hypothetical protein SINV_02639 [Solenopsis invicta]
Group
KEGG pathwayame:7244851e-33 
 K11097 (SNRPE, SME)maps-> Spliceosome
InterPro domain[225-408] IPR0065712.1e-23TLDc
[72-182] IPR0117057.2e-14BTB/Kelch-associated
[14-70] IPR0109207.5e-13Like-Sm ribonucleoprotein (LSM)-related domain
[23-71] IPR0011637e-10Like-Sm ribonucleoprotein (LSM) domain
[22-99] IPR0066497.4e-06Like-Sm ribonucleoprotein (LSM) domain, eukaryotic/archaea-type
Orthology groupMCL13481 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212345-TA
ATGGCATACAAAGGCCCACCTAAGGTACAGAAGGTTATGGTACAACCCATTAACCTTATCTTCAGATATTTACAAAACAGAAGCCGTGTACAAATATGGCTTTACGAGAATGTGAATTTGAGAATCGAGGGTCACATTGTTGGTTTCGATGAATACATGAATATTGTTTTGGACGAAGCTGAAGAAGTTCACATGAAAACCAAAAATCGCAAGCAAATTGGAAGTGGTAAGAGCGCATCGTCTTTCTTAGAGCGCTGCATATCATTCATTGGAGACAATGCTGCGGATTGTGTCAAGACTAATGCGTTCCTCAACCTCCCAAAAGAAGCACTCATCAAACTCATATCTTCAGACTTTCTGTGTCTGGAGGAGGAGGAGGTGTGGCGTTGTGCTCTGGCGTGGTCCAAGCAGCGGGCCGGCGTGACGCAGCCCGCCGTGCATTGGACGGGCGAGGAGCGAGCCCGGGTCTGCCAACACCTGGCCCCGCTCATGCAGCACGTGCGACTGCTACTCATCGACAGTACGGTGTTCGCGGAGGAGGTGGAACCCACGGGAGCCGTGCCCATGGAACTGTCCTTGGAGCGCTACCGCCGCGCCGCACTGCACGCCGCACCGCGACACGAGCCCGACAAGAGGACGCAGCCTCGGTCGGCCGTGAACATGTTCGTGGGGTCGGTGATCCTGCAGCAGGACCGCGGGGGCCTGCAGTCCCTGGTGAACAGCTGGTGTGGGGCGCCGGGGGGCCGGCGGGCCTGGCGGCTCGTGTTTCGCGCCTCCAGCCACGGCTACTCTGCCGCCGCCTTCCACACGCACTGTGACGGAGTGGCGCCCGTGTTACTCTTAGTACAGCTGTCCCGGGGCGAGGTCATAGGCGGCTACAGTACGGCGGGCTGGTCCCCGGGCGGGGCGGGCGGCTACGTGTCTTCGGAGCGCGGCCTGCTGTTCTCCCTGAGCGAGCCGCCGGTCCGCTACCCGCTCCTCAAGAAACCCTTCGCCCTCTGCTACCATCCAGACTGTGGGCCCATATTCGGCGCGGGTGCGGACCTGCTGATCTCCAACAACTGTAACATGAACAGTGACAGCTACAGTAACCTCCACTCGTACGGGGACGGCTCGCTAGGGTCACTGGGATCCCTGGGGTCTCCGGGGCCCCAGCCCGCGCCCTCCTCGCTCGCATCTGAGTACAACTTCACCGTCCGTGACTACGAGATCTTCACGCTCGACCACTAA

Protein sequence:

>DPOGS212345-PA
MAYKGPPKVQKVMVQPINLIFRYLQNRSRVQIWLYENVNLRIEGHIVGFDEYMNIVLDEAEEVHMKTKNRKQIGSGKSASSFLERCISFIGDNAADCVKTNAFLNLPKEALIKLISSDFLCLEEEEVWRCALAWSKQRAGVTQPAVHWTGEERARVCQHLAPLMQHVRLLLIDSTVFAEEVEPTGAVPMELSLERYRRAALHAAPRHEPDKRTQPRSAVNMFVGSVILQQDRGGLQSLVNSWCGAPGGRRAWRLVFRASSHGYSAAAFHTHCDGVAPVLLLVQLSRGEVIGGYSTAGWSPGGAGGYVSSERGLLFSLSEPPVRYPLLKKPFALCYHPDCGPIFGAGADLLISNNCNMNSDSYSNLHSYGDGSLGSLGSLGSPGPQPAPSSLASEYNFTVRDYEIFTLDH-