Monarch geneset OGS2.0

DPOGS210274
TranscriptDPOGS210274-TA930 bp
ProteinDPOGS210274-PA309 aa
Genomic positionDPSCF300216 + 97595-106493
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0086212e-2952.88% 
BombyxBGIBMGA000027-TA2e-2750.96% 
DrosophilaCG3640-PA5e-0632.63% 
EBI UniRef50UniRef50_P357784e-0940.96%Venom allergen 3 n=11 Tax=Vespoidea RepID=VA3_SOLIN
NCBI RefSeqXP_001865175.17e-1037.36%venom allergen 5 [Culex quinquefasciatus]
NCBI nr blastpgi|4477126e-0940.96%allergen sol i III
NCBI nr blastxgi|1984431458e-0940.96%Chain B, Crystal Structure Of The Major Allergen From Fire Ant Venom, Sol I 3
Group
KEGG pathway 
InterPro domain[4-146] IPR0140441e-16CAP domain
[12-83] IPR0012836.2e-13Allergen V5/Tpx-1-related
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210274-TA
ATGGGCCCAACATGCGCGGGTGTAGAAAATGCGACTATGACGGAGGACAATGCGGCTCTGATACTAGATTTGATCAACAGTATAAGAAGCAGAGCAGCTCGTGGTCTGGCCATGGGCTATGAGAAAGAATTACTCCCAAGAGCCTATGGAATGTACAGAGTCGAATGGGATCCAGAATTGGCCACATTGGCCCAAGTGTGGGCTAATCAATGTGTTTTAGAACGAGACAATTGTCGAGCAACTAAAAATTTTCCCGATCCCGGACAACAGGCTTCGATAGCTCGCTTTGTAACGGACAAATGGATACCTATAAGTAAAACGAAAGACAAAACTTATAATGAATCATCTGGTTTTAATTCACACAAGGTAATCCGTCTGGTTTGCAATTTTTCTTCAAGAGTTTACGACGATCGTGGGATATATAACGTCACTGCTCCAACCACATCAGAGTTCACTCCCCAATGTGGCTGTCCGCCAGGATACGACGAGGATTCCTGGTGTTTGTGCTACAAAAGCGAAAAGAATAAAAAAACAAACTGTAAGATTAACAACACGAAGTCTCTGTCATTGAAGAAAATTCGAAATCAGCACACGAAAAGTAGGCAGACCCAATTAAATTATAATAATAATCTATACGAAAAAGAAAGCACCGAGAAAGACAAAGCCGAATTTCCAATTTTCTTAAGAAGAAGAAATAAGAAGAAGCATCATGGCAACAACAATAACGACCTACAACCAAAAAGTTCTAGAAAGAACGAGAAAAGAAAAAAATTAAATAAAAAACGTAAACCAAAAGCCAAATACAACAAGAAATTCATAAAAACAACTTTAAAAGACATCGACAGGGATTCCAATATGTCCGGTTCGAAAGACAGTGATAGCAAAAACAAACCCGTCGACATTATTGTTCATATTAAAATGAACGAATAA

Protein sequence:

>DPOGS210274-PA
MGPTCAGVENATMTEDNAALILDLINSIRSRAARGLAMGYEKELLPRAYGMYRVEWDPELATLAQVWANQCVLERDNCRATKNFPDPGQQASIARFVTDKWIPISKTKDKTYNESSGFNSHKVIRLVCNFSSRVYDDRGIYNVTAPTTSEFTPQCGCPPGYDEDSWCLCYKSEKNKKTNCKINNTKSLSLKKIRNQHTKSRQTQLNYNNNLYEKESTEKDKAEFPIFLRRRNKKKHHGNNNNDLQPKSSRKNEKRKKLNKKRKPKAKYNKKFIKTTLKDIDRDSNMSGSKDSDSKNKPVDIIVHIKMNE-