Monarch geneset OGS2.0

DPOGS212120
TranscriptDPOGS212120-TA1053 bp
ProteinDPOGS212120-PA350 aa
Genomic positionDPSCF300038 - 167692-189212
RNAseq coverage1016x (Rank: top 12%)
Annotation
HeliconiusHMEL0077302e-5982.71% 
BombyxBGIBMGA006590-TA4e-13791.19% 
DrosophilaSyx1A-PB3e-11871.92% 
EBI UniRef50UniRef50_Q245476e-11671.92%Syntaxin-1A n=26 Tax=Bilateria RepID=STX1A_DROME
NCBI RefSeqXP_002423912.18e-13190.66%syntaxin-1A, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700142385e-13283.10%Syntaxin 1A [Tribolium castaneum]
NCBI nr blastxgi|2700142385e-13880.44%Syntaxin 1A [Tribolium castaneum]
Group
Gene OntologyGO:00160202.4e-68membrane
GO:00161922.4e-68vesicle-mediated transport
GO:00055158.2e-25protein binding
KEGG pathwayphu:Phum_PHUM0954102e-130 
 K04560 (STX1A)maps-> SNARE interactions in vesicular transport
InterPro domain[50-270] IPR0109892.4e-68t-SNARE
[52-154] IPR0060118.9e-42Syntaxin, N-terminal
[221-282] IPR0007278.2e-25Target SNARE coiled-coil domain
Orthology groupMCL11583 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212120-TA
ATGGCTAGCTCGCCTCGGCTAAGTGGATTCAATTCTACTTTACGTGAGCTTTCTGATGTCAACTGTCTACGTATGAGCATGGAAATGTTGTTGGCGCAAAGTGACGATGATGACGTCGGCCCCGATGACGTTAATGTCCCCGTCGAAGGAGGTTTCATGGATGAATTCTTCAGTGAGGTGGAAGAGATACGAGAAATGATAGACAAGATACAGGCGAACGTCGAAGAGGTGAAGAAGAAGCATAGTGCCATTTTATCAGCGCCGCAATCAGATGAAAAGACAAAGCACGAACTTGAAGATCTCATGGCCGATATTAAGAAGACTGCCAATAAAGTTAGAGGCAAATTAAAACATATAGAACAAAACATCGAACAAGAGGAGCATTCCAACAAATCGTCCGCCGATCTCAGGATAAGAAAGACCCAGCACTCGACACTGTCTCGCAAGTTCGTGGAGGTGATGACGGAGTACAACCGTACCCAGACCGACTACAGGGACAGGTGCAAAAACAGAATACTCAGACAGCTGGAGATCACGGGACGGGCCACGACCGACGACGAGCTGGAGGCTATGCTGGAGCAGGACAATCCAGCGGTATTCACCCAAGGGATTATAATGGAGACTCAGCAAGCCAAGCAGACCCTGGCCGATATAGAAGCCCGACACGCTGATATCATCAAGCTGGAGACCTCCATCCGCGAGCTTCACGACATGTTCATGGACATGGCAATGCTTGTTGAGAGTCAGGGTGAGATGATCGACCGCATTGAGTATCATGTAGAGCACGCTGTGGACTATGTCCAAACTGCCACCCAAGATACAAAGAAGGCCCTTAAATATCAGAGCAAAGCCCGACGGAGCGAGCAGGGTGAACTGATAGATCGCATCGAGCACCACGTGACCGCTGCCACTGACTACGTGGAGTCCGGTGGGGGGGAGCTTGTACAGGCTCACAAGTGGGCGGTTAAAGCGAGGAAGAAGAAGATCATGATCCTTGTATGCCTGCTCATACTGGGCCTGGTGGTGGTCGGATACGTCTGGTCCTCATTGTGA

Protein sequence:

>DPOGS212120-PA
MASSPRLSGFNSTLRELSDVNCLRMSMEMLLAQSDDDDVGPDDVNVPVEGGFMDEFFSEVEEIREMIDKIQANVEEVKKKHSAILSAPQSDEKTKHELEDLMADIKKTANKVRGKLKHIEQNIEQEEHSNKSSADLRIRKTQHSTLSRKFVEVMTEYNRTQTDYRDRCKNRILRQLEITGRATTDDELEAMLEQDNPAVFTQGIIMETQQAKQTLADIEARHADIIKLETSIRELHDMFMDMAMLVESQGEMIDRIEYHVEHAVDYVQTATQDTKKALKYQSKARRSEQGELIDRIEHHVTAATDYVESGGGELVQAHKWAVKARKKKIMILVCLLILGLVVVGYVWSSL-