Monarch geneset OGS2.0

DPOGS213649
TranscriptDPOGS213649-TA2994 bp
ProteinDPOGS213649-PA997 aa
Genomic positionDPSCF300165 + 121443-129523
RNAseq coverage1259x (Rank: top 10%)
Annotation
HeliconiusHMEL0045883e-16760.97% 
BombyxBGIBMGA004568-TA0.061.55% 
DrosophilaCG10600-PB3e-15656.40% 
EBI UniRef50UniRef50_E2BN191e-16363.13%Round spermatid basic protein 1-like protein n=3 Tax=Formicidae RepID=E2BN19_HARSA
NCBI RefSeqXP_001608161.13e-16653.26%PREDICTED: similar to ENSANGP00000023698 [Nasonia vitripennis]
NCBI nr blastpgi|3800268985e-16662.73%PREDICTED: uncharacterized protein LOC100872740 [Apis florea]
NCBI nr blastxgi|3800268984e-15945.92%PREDICTED: uncharacterized protein LOC100872740 [Apis florea]
Group
KEGG pathway 
Orthology groupMCL22074 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213649-TA
ATGGCGTCTAATGATTCCCACGTAAAAGTTAATGAAAATTGTGATATCTTGGACGATTGCTACAGTTCATGCAGCGAAACAGATGCCGGTGAAGGTACCAAGCGCCGTAGAGTGACTCATGACTATCGAAAACTTTCGAAATTGGGACATGATCCAAGTGTGTCAACTAATTTAAAACATGCAACTACTGTTTTCAAAGATCGAAAGCGTAAAGGCAGTACAGATTCCAAGGAGTTGGCAGTGTGGGGTGAGAATGCACGACCAGCCAAATGCAGCAAGATATGGCCAGTGTTTGACTCCGGAGGGTTCGGAGATGCTTCAATGTTAACTCTCCACATTGTGCAGAAATCGAGGCCATTAAAGAATCCGCTGTTGTTTAGTAATAACAGTCTGGGTATAGACCTCGATAACATTCGATTAGATTGGTTCGGTGGCATCAATACTTCAAAGAAACAAATGCATCTCTCAAACACCATTCCCAGGGACGATCTCACTAAACGCATATACTCTTCGGTGTCGAAGCAGCTCAGGATAATAAACACGCATCTACAGAGATCCATGTTCAGATACAAGTATCCCAAAGTAACGAGAGCTAATGTTCTGTTGGAGTCCTCTACTTCCAGACGTCTGTCGACACACTTCAGTTTGTCACTGAGAGCTATAAAGTTCCAGTCCACGCCCAGGAGGCAGCTGCCGTGCTACGGGAACCTGGAGCTACCTCTGAAGATGAAGCAAGAGATATCCACACAGTTCTATCCAGTGATCAGTAGGAGCTTACACACGGAGAGTGACCATAGTGATATTAAAGTGAAAGTTGAAAGGGAACTTGTTAAATATGAAGAAGAGAAGGAGAAGAAGAAAGGAGACAGAGAGAGGGAGAGGGAGAGAGACAGAGATAGATCGCATTCATCTCGTCACTCGTGTCAGCAGTGCAAGCGTCGCGCCAAGGTGAAGAGGTGTTCCATAGGAGTCCAGTGCAGGAGGGAGCGCGCGCCGCCGCTGTGGGGTCCGCGGGCACACAGCCTGCAGCTCAAGGAACTCAAGTATCACAGACTGATGCGTGTGGAGACCCATCCTAACGGCGGCGCCAGTGTAGTGTACCTTTACCAGCACGACGTGGACATGTTGAGTGCAATACAGCAGGAGGAGCTGGCCAAGGAATTCTTGAAGCTGGTGTTCTCGGAGGACAGCGAGGGCTGGGCTAACTACGTGTGTGGTGTAGTGCGAGGTGCCGCGGGCAGGCTGCCGTGTCTTCTGCAGTACCTCGCGGAGACCTACCCCACGCTCACCGTGAAGGCCGGAGTCCTCGCCAGAGCCTCGGACATAGAGACCACCACCATCAGCAGATACTACCAACAGGTGCAGCAGTCGTACCTGGCGGGCACATTCAGGTCTGGACCTCTCCATCAGATATCATTAGTGGGGACGGTGCATGAAGAAGTCGGGGGGTACTTCCCGGACATGCTGGACATGTTGACGGATTGCCCCTTCCTTAAACTGACGATGCCGTGGGGCCCGATGTCCGCGGTGGAGATGGAGCCGCAGGAGTCCAACGACGGACCCATACTGTGGATCAGGCCGGGGGAGCAGCTCGTGCCCACGGCGGAGCTAGGAAAGAGTCCCATTAAGAAGAGACGAACCGGCATGAACGAGCTCCGGAACCTGCAGTACCTGCCGCGGTGGAACGAGGCCCGGGAGTTCCTGTTCGAGGACAGGACGCGGGCGCACGCCGACCACGTAGGACACGGACTCGACAGGATCACTACGGCAGCCGTCGGCGTGTTGAAGGCGGTGCAGTGCGGAGACTGTGAGCGCGGCCGCATCACCAAGGACGTGGTGGCCTTCCACGCGGCGGACTTTAACAAGCTGGTGCAGCTGCTGCAGCTGGACCTGTACGAGCCTCCCATCTCACAGTGCGTCACATGGCTAGAAGAGGCAAAGCTCAACCAACTGCGGAGAGAGGGAGTTCGTTATTCGCGCCTCCCTCTGTGCTCTGATGACGTTTACTTCCTGCCTCGGAACATCATCCATCAGTTCAGGACCGTGTCCGCTGTGACGTCCATCGCCTGGCATCTCCGTCTGAAGCAGTACTACCCCGGCTCCGAGGCGTCTTCTCCGGCCGACGAGCGAGCCGCGCCCGCCACGCCCAGGGACGACCTCGCGGAAAATAAATCAACGAAACATAAAAGCGGGGTCATAGAGATGAGGGCGCTGAGTGCCAAGTTCAAAGAGAAGGCGGCTTTGAAGATAACGGAGAGCAGGAAGAGCTTAGACGATCAGAGTAAAGATAGACAGACGAGCAGGGCAGAAGACAAGAAGGAGAGACGGACAGAGAAACATTATAGAACAGAGAAGATAGAGAAAGGCGAGAAACACTCCCACAGGCATAGAGACGCTGAGAAACATAGAGCCGAGAGGTCGGATAGGACGTCCGATAAACAAGACGTGAGAACGTTGGATAAAGAAAATAAAGTTGACAAGGCTGATAGAAGTGACAAAGGAGAGCGGCCGGAGAGAGACAAGACGGATGCGAAGGTCACAGAGAGGAGTGAACGTGCTGAGAAAAGTGACAAGTCGGATAAGTTGGACGGAAAGAGTGAGAGGACAGACAAGGTCGAGACGAGAACTGAGAGAACGGATAAAGAAGAAGCGAAACCTGAAACGTCCGAGAAACTTGAAGTGAAATTCGAAAAATTTGAAGTGAAATCCGAAAAATCCGACAGAACAGAGAAGCCCGAGCAGAGGACGGAAAAGACAGAAACCAGGACAGAGAAAGAGAACAGGACGGACAAAGACGAGCGATCCAACAAGTCGCACGCCCACAGGAAACACTCCTCGAGACACAGATCGTCACATCACAAGGACAGGTCGAGCGACAGGAGAGAGCACAGGAGACACAGGGCGGGCGACAAGAAACAAGAGAGGCACGCGGAGAAGAGTGACAAGCTGGACTGTGACAAGTTACCCAGGGTCGTGCACCGCTCACCCGACTAG

Protein sequence:

>DPOGS213649-PA
MASNDSHVKVNENCDILDDCYSSCSETDAGEGTKRRRVTHDYRKLSKLGHDPSVSTNLKHATTVFKDRKRKGSTDSKELAVWGENARPAKCSKIWPVFDSGGFGDASMLTLHIVQKSRPLKNPLLFSNNSLGIDLDNIRLDWFGGINTSKKQMHLSNTIPRDDLTKRIYSSVSKQLRIINTHLQRSMFRYKYPKVTRANVLLESSTSRRLSTHFSLSLRAIKFQSTPRRQLPCYGNLELPLKMKQEISTQFYPVISRSLHTESDHSDIKVKVERELVKYEEEKEKKKGDRERERERDRDRSHSSRHSCQQCKRRAKVKRCSIGVQCRRERAPPLWGPRAHSLQLKELKYHRLMRVETHPNGGASVVYLYQHDVDMLSAIQQEELAKEFLKLVFSEDSEGWANYVCGVVRGAAGRLPCLLQYLAETYPTLTVKAGVLARASDIETTTISRYYQQVQQSYLAGTFRSGPLHQISLVGTVHEEVGGYFPDMLDMLTDCPFLKLTMPWGPMSAVEMEPQESNDGPILWIRPGEQLVPTAELGKSPIKKRRTGMNELRNLQYLPRWNEAREFLFEDRTRAHADHVGHGLDRITTAAVGVLKAVQCGDCERGRITKDVVAFHAADFNKLVQLLQLDLYEPPISQCVTWLEEAKLNQLRREGVRYSRLPLCSDDVYFLPRNIIHQFRTVSAVTSIAWHLRLKQYYPGSEASSPADERAAPATPRDDLAENKSTKHKSGVIEMRALSAKFKEKAALKITESRKSLDDQSKDRQTSRAEDKKERRTEKHYRTEKIEKGEKHSHRHRDAEKHRAERSDRTSDKQDVRTLDKENKVDKADRSDKGERPERDKTDAKVTERSERAEKSDKSDKLDGKSERTDKVETRTERTDKEEAKPETSEKLEVKFEKFEVKSEKSDRTEKPEQRTEKTETRTEKENRTDKDERSNKSHAHRKHSSRHRSSHHKDRSSDRREHRRHRAGDKKQERHAEKSDKLDCDKLPRVVHRSPD-