Monarch geneset OGS2.0

DPOGS211384
TranscriptDPOGS211384-TA3312 bp
ProteinDPOGS211384-PA1103 aa
Genomic positionDPSCF300115 - 629344-643477
RNAseq coverage275x (Rank: top 39%)
Annotation
HeliconiusHMEL0021002e-15362.91% 
BombyxBGIBMGA010951-TA3e-16380.00% 
Drosophilasec8-PA4e-4330.37% 
EBI UniRef50UniRef50_UPI00015B40D64e-4532.51%UPI00015B40D6 related cluster n=1 Tax=unknown RepID=UPI00015B40D6
NCBI RefSeqXP_001608195.16e-4632.51%PREDICTED: similar to ENSANGP00000008527 [Nasonia vitripennis]
NCBI nr blastpgi|1565379941e-4432.51%PREDICTED: exocyst complex component 4-like [Nasonia vitripennis]
NCBI nr blastxgi|1565379941e-4532.51%PREDICTED: exocyst complex component 4-like [Nasonia vitripennis]
Group
Gene OntologyGO:00001452.8e-14exocyst
GO:00069042.8e-14vesicle docking involved in exocytosis
GO:00150312.8e-14protein transport
KEGG pathwaynvi:1001242722e-45 
 K06111 (EXOC4, SEC8L1)maps-> Tight junction
InterPro domain[34-133] IPR0071912.8e-14Sec8 exocyst complex component specific domain
Orthology groupMCL13995 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211384-TA
ATGTCTTCACCGCCCCCCACTAAACCACCGCGAGGAGTGAAACAGGGAAAAGAAACTAGTGGCTTGTTAATGTCAGTGATCCGAACACTATCAGCGAGTGAAACAAACGAACAGAGGGAAAAAGAAAGAGCAAAACTGGAAAAGGAGTACAAAAAGAGTGATGCACGCCTTGACGAATTGGTGGCAGCTCATGCTCAAGAAATTACAGATGTGATGCAGAAGTTCAGTTCAGTGAGCACGGCCCTCAGTGCTTGGGGACAACATTGTTCAGCGGCGGAGGCTCGGCTGGCGGCTTGTCGCGCTCTACTGAGAGTCCGAAGAGATGACCTCCGAAGACTGTGGGCCGATGCCAGGACACATCACCATGCCTTACGGATGCTCACCGATATAGAGCGGGTAGTGGTGTGTGAGCGGGAGGCTCGCGAGCTGACCGCGGCGGGGCGGGTGGTGGGCGCGGCGCACGCCCTCAGGGCGGCCTTGATGACGGCCCGGACCGAGCTTCCGGCCGTCACCGCGCTCAACGCCACCACCGACCATTTAGAGGCCAAGAAGCAGGAGTTGGTTGACGTCATAGTCGGCCGTTTGAGTGACGCGGTGTACCGTGACGAGCGAGCGCCGCTCAACAGACGACTGTCCGCCAGGAGGAGGACCGCGCTACTGCTGGCCGAACTGACGAAGAGCGAGGAGGTGACGGATTCGTATTTACAAGACATAACGCCAGAGTACGAGGCGTCCCTCACCGAAGACGACAAGACCTTTGAGACCACAGTCATGATATCACTAGAGGCGCTCGGAGTGCTGGAACAACTGAGAGAGGCCACGGAGAAGTTCAAAGTTCAAATACAGAACGAGCTGCTGGAAGTCATCGACTCCGTGTCTCGGCGGATCATAGAGGAAGGAGACGTAGAGGAGGGAGAGGAGGACGGGGAGATACAGGAGGCGGGGGGAGAGAGCGAGACGGGGAGACCGCTCACCAGGCTCATACACTGTCTGACTGAAGAGTTCGAGGCTGGCGGGACGAGGAACAGGCGTTTACTACAGCTGTGGCGCGCCGCGTTAATAAAACACAAGATCTCCGAGCAGTCGCTGCACACCGAGCAGCACTACTGGAGCGCCGTGCAGCAAGTGGTGGGAGCGATGTACTATTACACCAATATAACGAGAACCTGGTGTTGGTTCAATGTAATCTACGTTCTCGTTATATTGGACGCGCGCCTCGCACACGCCTCCCTACAGCCGCCGCCGCCCGCCCGCGCCGCTCGCTGGCTGGCCGACGACGACATCGTGCGCTTCCTCACCGCACTGCCTAACTACGTGGCCGCAGTGCAACGACAGGACTACACGGACAAGAAGTTCAGTTCAGTGAGCACGGCCCTCAGTGCTTGGGGACAACATTGTTCAGCGGCGGAGGCTCGGCTGGCGGCTTGTCGCGCTCTACTGAGAGTCCGAAGAGATGACCTCCGAAGACTGTGGGCCGATGCCAGGACACATCACCATGCCTTACGGATGCTCACCGATATAGAGCGGGTAGTGGTGTGTGAGCGGGAGGCTCGCGAGCTGACCGCGGCGGGGCGGGTGGCCAAGAAGCAGGAGTTGGTTGACGTCATAGTCGGCCGTTTGAGTGACGCGGTGTACCGTGACGAGCGAGCGCCGCTCAACAGACGACTGTCCGCCAGGAGGAGGACCGCGCTACTGCTGGCCGAACTGACGAAGAGCGAGGAGGTGACGGATTCGTATTTACAAGACATAACGCCAGAGTACGAGGCGTCCCTCACCGAAGACGACAAGACCTTTGAGACCACAGTCATGATATCACTAGAGGCGCTCGGAGTGCTGGAACAACTGAGAGAGGCCACGGAGAAGTTCAAAGTTCAAATACAGAACGAGCTGCTGGAAGTCATCGACTCCGTGTCTCGGCGGATCATAGAGGAAGGAGACGTAGAGGAGGGAGAGGAGGACGGGGAGATACAGGAGGCGGGGGGAGAGAGCGAGACGGGGAGACCGCTCACCAGGCTCATACACTGTCTGACTGAAGAGTTCGAGGCTGGCGGGACTAGGAACAGACGTTTACTGCAGCTGTGGCGCGCCGCGTTAATAAAACACAAGATCTCCGAGCAGTCGCTGCACACCGAGCAGCACTACTGGAGCGCCGTGCAGCAAGTGATGCAGCTTCTACTAACGGAATATCTGGAGATAGAGAGTGTCGCGCTGTCCTCTCAGCGAGCGCCTGAACCTGTCACTGAGTTGAAACTGAGCGATTACTTCAGTAAGAAGAAACCTCAGAGGAGACGAGAGAAGCTGTTCAAACTGATGACGCCCCCGCCGCCGTCAGCCCGCGGCCGCCGTCACCGCTACCCGCTAGTGTGCACTCCGCAGCCGGCGCTGTTGCACGCCGCGTTACCGGCCTTACACGCTCTGTGCGCGCTGATAGAGAAACCCGGTGCCGGTGAGTGTTCACTGCGGATGTTCGTGTCGGACTACGTGAAGTGGTGTGAGACCGAGCGTCTCTCCCGCGAGGCCCGCGCCGCGGTCGAGGCTGCTGTCAAGGCCCCCGGGGCCTGGAGAGAGAGGGTCATGGTCCCTGGACGGTCCGCCTGGTGTCCGATTAACTTTACAGTACAAGTAGTAATTGTGAGGAACCCGTTATTCCCTGTCGCCTCACTTATAAGTCCCCCCGTGTCCAGTGAGAGCGAGCGGCCGGTTTTGTCGTGTTGTGTGGAGGCGTGGTCGGTGGTTCGCATCGAGTGCTTCCACTACCTCGGCCAGGAGGAGCGCGAGGAGGCCGTGGGGTCCGGCGGGGGGAGGCCCGGGGGCGCAGGCCCCGGTGACGGCGAGGAGGCCGGAGGCGCCGCCCGCCTCGCGCACAGACTGCTCGGCTTCCACGAACACGCGGCCGGCATACTGACTCCCGCCGCCATGACGTACATAATGAACGGTATCGGCGAGATGATGTCCGCGGCGGTGGTGTGGCGCTGGCAGAGCTCGGCGGCCGTGGACGGTGCGGGGGGCGCGAGGGGTGCGGAGGGTGCGAGGGGTGCGGTGTCCCGCCTCGCCGCCCTCCGCCACTGCCTTGCGGCGCTGTCAGTTCCTCACGACGGTCTCCACTCCGCACACGCTTACATGCACCTGCTGGCGTGCACCCCCGAGGAGATCATAACGTCGGTCCGTGAGAAGGGTCCGCAGTTCTCGGAGTTGGAATACTTGAACGCTTTCAAGGTGATAGGAGCTCGTCGCGGCCTGGCCCCCGCCGACATGAAGGAGCAGCTCAAACAACTGTCCGCCGCCCTCGGACACGTCGGCGTCACAGTGTGA

Protein sequence:

>DPOGS211384-PA
MSSPPPTKPPRGVKQGKETSGLLMSVIRTLSASETNEQREKERAKLEKEYKKSDARLDELVAAHAQEITDVMQKFSSVSTALSAWGQHCSAAEARLAACRALLRVRRDDLRRLWADARTHHHALRMLTDIERVVVCEREARELTAAGRVVGAAHALRAALMTARTELPAVTALNATTDHLEAKKQELVDVIVGRLSDAVYRDERAPLNRRLSARRRTALLLAELTKSEEVTDSYLQDITPEYEASLTEDDKTFETTVMISLEALGVLEQLREATEKFKVQIQNELLEVIDSVSRRIIEEGDVEEGEEDGEIQEAGGESETGRPLTRLIHCLTEEFEAGGTRNRRLLQLWRAALIKHKISEQSLHTEQHYWSAVQQVVGAMYYYTNITRTWCWFNVIYVLVILDARLAHASLQPPPPARAARWLADDDIVRFLTALPNYVAAVQRQDYTDKKFSSVSTALSAWGQHCSAAEARLAACRALLRVRRDDLRRLWADARTHHHALRMLTDIERVVVCEREARELTAAGRVAKKQELVDVIVGRLSDAVYRDERAPLNRRLSARRRTALLLAELTKSEEVTDSYLQDITPEYEASLTEDDKTFETTVMISLEALGVLEQLREATEKFKVQIQNELLEVIDSVSRRIIEEGDVEEGEEDGEIQEAGGESETGRPLTRLIHCLTEEFEAGGTRNRRLLQLWRAALIKHKISEQSLHTEQHYWSAVQQVMQLLLTEYLEIESVALSSQRAPEPVTELKLSDYFSKKKPQRRREKLFKLMTPPPPSARGRRHRYPLVCTPQPALLHAALPALHALCALIEKPGAGECSLRMFVSDYVKWCETERLSREARAAVEAAVKAPGAWRERVMVPGRSAWCPINFTVQVVIVRNPLFPVASLISPPVSSESERPVLSCCVEAWSVVRIECFHYLGQEEREEAVGSGGGRPGGAGPGDGEEAGGAARLAHRLLGFHEHAAGILTPAAMTYIMNGIGEMMSAAVVWRWQSSAAVDGAGGARGAEGARGAVSRLAALRHCLAALSVPHDGLHSAHAYMHLLACTPEEIITSVREKGPQFSELEYLNAFKVIGARRGLAPADMKEQLKQLSAALGHVGVTV-