Monarch geneset OGS2.0

DPOGS212285
TranscriptDPOGS212285-TA2049 bp
ProteinDPOGS212285-PA682 aa
Genomic positionDPSCF300077 + 434264-443508
RNAseq coverage874x (Rank: top 15%)
Annotation
HeliconiusHMEL0149254e-9870.31% 
BombyxBGIBMGA011445-TA4e-6456.63% 
DrosophilaCG31211-PB1e-1034.33% 
EBI UniRef50UniRef50_E2BUF14e-5034.44%Splicing factor, arginine/serine-rich 18 n=2 Tax=Formicidae RepID=E2BUF1_HARSA
NCBI RefSeqXP_001121946.14e-5334.43%PREDICTED: similar to CG31211-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3287808717e-5131.19%PREDICTED: splicing factor, arginine/serine-rich 18-like [Apis mellifera]
NCBI nr blastxgi|3287808712e-8432.79%PREDICTED: splicing factor, arginine/serine-rich 18-like [Apis mellifera]
Group
KEGG pathway 
Orthology groupMCL16559 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212285-TA
ATGTTCTCTAGCAAAGATGCCGTAAATCCAGGATATCCCACGCAGTGGGCTTTAAATCCAACTGCATATCAAAATATAGATTCAAGCCAAGTAGATTGGGCAGCTTTAGCACAACAATGGATTGCTATGAAAGAAGCCGCGGTGATTGTGTCAACGCCACAGTCAAAAGCCGATGTGGAAGAGGGCGAAGCACCTATGGAAGTAGAAAATCCAGAGGCCAGCGAACCACCTATCGGTGCAGGCCCCGAGTGGAATGGCTCAACGAACTCATGGGGAGGTTCTTGGAACCAGTGGGGATGGGGTTGGAGTGGCACAGGACCAATGGATCCTAAAATGGCTAGTGATCCATCAATAGCCATGGGTCCAATGATGGATAGCTATCCCGTAGCAGATAACAATACAACCATGCCAGGTTACACGAGCGGTGCTGTGCCGACTCCAACATTTCAACATGGTTATTGGACGGCTCAAAACTCTGACCAGTCGGGCAATAACCGGAATCGCGATAGGAGATCTAAGAGCAAAACTAGAGAAATTAAACCTACCAGAAGTAGATCACATAGAGATAAGTTACCTTTAATACCACCTGTGATGGAACCGCTGGTTATGCCGACTCCGACATCTACTATTGACGCAGCTAAGAGACGACAGCTACCGGCTTGGATAAGAGAAGGCTTAGAGAAAATGGAACGAGAGAAACAGAAAGCAATAGAAAGGGAACAGGAGAAGAAAGCGAGAGAGGAAGCGGAGAAGGAGAAGAAGAGGATTGAAGAGGAAGAGTTGCAGAGGCTGAGGGACGAGGGACATACTGTGCTGCCGGCCAAGAGCAAATTCGATTCAGACTCCGAGGGCGAAGCTCCCCCTCCCCCTCCCGCTGTCATTCCTCCGCCCCTGGGACGAAAATCAAAAGAGGAGGCTCTTCAGGATGTGATGTTAGCGGTGCGTCGTTCGCTAACAGAGATCCTGTTGGAGGTCACTGACAGCGAGATACAAACGGTGAGCCAGGAAGAAGTGGCTCGGTATAACGCCGCACAAGCGTCGCGACTCAACGCTATGAAGGCGAGCAAGTCCAAGGCGCTCGCGTCCATCGCCAGCGGTCTCGGTCTTGGAGCGTACGAGAGCAGCGAGGACAGCGGCGACGAAGACCAACACGATATGTCCGACCAGCAGTTACAGGAGGTCATAAGACGGAAGCGTCAAGAATTCGAACGTACCTCACGAGAGATAGAGGCGGAGGTGAGACGAGCTGAACAACGAGAGAATGAAGAAGAGGGCTCTCAGCATCACGACACGCCGGAGAGACCGCGCAGATCACGTTCTTCTGCTACGCCGCCGCCGCTGGACAGTGAGACGCCAGAGAAGAAACCGGAACGTCGCCCGTCTAAGGATAAAAGAAGCAACCACAAGTCATCTGAGAAGACAGATAGAAAATTGGACGTCATTCAAGAAGAGAAGACACCAAAGAAAACGAACAAATACGAAACTACACCAACCATGACGAAAGCTATCAAGTCGAGCTCAAATTCAAGTTCCAGTGACTCAGACGACGACTCGTCTAGTACCAGTAAATCATCGTCAGAGAGTGAACCGGAAGTGAAAGTAGAAAACAATAGGACTAAGAAACGTAAGCGTAGAAGTACCAGCTCCAGCGACACTAACAAGAAATCAAAGAAACATAAGAAAGACAAATCACACAAATCGAGCGAGAAGAGTTACTCCAAGAAACATCAGGAGGAATACGATAGGAATGATAAATCTAGATCCAAGAGAAAAGACGAATATTATGAAAAGCACAAACATAGAAGCCGGGACGAAAGGTCGCACAAAGACAAGTACAGGGAGGAGTCGGACGAGGACAGGGCGAGGAAGCGGTCCAAGCGTTCAGTTAGCTACGAATCAAGATCGGGACGACGAAAAAGTAGAGATCGATCCGAAGATAGATCTCGAAGGCGGGACAAACGATCCTACGATAGGGATAGGTCTTACGATAGGTCCAGAGACTACGATAGGTATGATCGCCACGACAACTATTCCCGCCATCGCAGATGA

Protein sequence:

>DPOGS212285-PA
MFSSKDAVNPGYPTQWALNPTAYQNIDSSQVDWAALAQQWIAMKEAAVIVSTPQSKADVEEGEAPMEVENPEASEPPIGAGPEWNGSTNSWGGSWNQWGWGWSGTGPMDPKMASDPSIAMGPMMDSYPVADNNTTMPGYTSGAVPTPTFQHGYWTAQNSDQSGNNRNRDRRSKSKTREIKPTRSRSHRDKLPLIPPVMEPLVMPTPTSTIDAAKRRQLPAWIREGLEKMEREKQKAIEREQEKKAREEAEKEKKRIEEEELQRLRDEGHTVLPAKSKFDSDSEGEAPPPPPAVIPPPLGRKSKEEALQDVMLAVRRSLTEILLEVTDSEIQTVSQEEVARYNAAQASRLNAMKASKSKALASIASGLGLGAYESSEDSGDEDQHDMSDQQLQEVIRRKRQEFERTSREIEAEVRRAEQRENEEEGSQHHDTPERPRRSRSSATPPPLDSETPEKKPERRPSKDKRSNHKSSEKTDRKLDVIQEEKTPKKTNKYETTPTMTKAIKSSSNSSSSDSDDDSSSTSKSSSESEPEVKVENNRTKKRKRRSTSSSDTNKKSKKHKKDKSHKSSEKSYSKKHQEEYDRNDKSRSKRKDEYYEKHKHRSRDERSHKDKYREESDEDRARKRSKRSVSYESRSGRRKSRDRSEDRSRRRDKRSYDRDRSYDRSRDYDRYDRHDNYSRHRR-