Monarch geneset OGS2.0

DPOGS204224
TranscriptDPOGS204224-TA3045 bp
ProteinDPOGS204224-PA1014 aa
Genomic positionDPSCF300046 - 719208-732328
RNAseq coverage931x (Rank: top 14%)
Annotation
HeliconiusHMEL0151420.074.77% 
BombyxBGIBMGA007503-TA2e-11458.44% 
DrosophilaEsyt2-PC0.051.53% 
EBI UniRef50UniRef50_Q5TVA90.057.53%AGAP003725-PA n=3 Tax=Culicidae RepID=Q5TVA9_ANOGA
NCBI RefSeqXP_001659792.10.053.52%synaptotagmin, putative [Aedes aegypti]
NCBI nr blastpgi|1571209140.053.52%synaptotagmin, putative [Aedes aegypti]
NCBI nr blastxgi|1955047540.050.97%GE23497 [Drosophila yakuba]
Group
Gene OntologyGO:00055154.4e-30protein binding
KEGG pathwayptr:4576575e-15 
 K00923 (E2.7.1.154, PIK3C2)maps-> Phosphatidylinositol signaling system
    Inositol phosphate metabolism
InterPro domain[883-1011] IPR0089734.4e-30C2 calcium/lipid-binding domain, CaLB
[296-381] IPR0000081.4e-15C2 calcium-dependent membrane targeting
Orthology groupMCL11270 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204224-TA
ATGCCAGTTAACAGCAAGTTCGCTCTTCCTGTGAGCAGTGATGAAAATTTGAGTGTGTTATCAATCATATACAGATTTTTTAAAAAGGTTTCAATAGTGGGAGCGGTGTACCTGGTGGGTTATATGCAATGGAGTGTGGCGTGGCTCCTGGGGCCGGTTGTGTTGTCTGTAATGAGGGACCAGTGGAGACGAGACAGCGAGTACCGTCGCAACCTCGCCAAGACAGCCGCCCTCTCCTCAGAAAAGGACATCGTACTGGCCAAGCTTGATGACCTACCGGCTTGGGTGTTTTTCCCAGACGTCGAAAGAGCGGAATGGTTGAACAGGATATTGCTTCAAGTGTGGCCTAATGTGAACCACTACGCCAGGACTCTTCTGAAGGACACCATTGAGCCTGCGGTGGCGGAGAGCCTCGCCAACTTCAAGCTTAACGGTTTCAAGTTCGAGCGCATGATCCTCGGTACCATTGCGCCGCGTGTTGGAGGCGTCAAGGTCTATGATAAGAACCTCTCGAGGGATGAAATCATCATGGACGTGGACCTATTCTATGCCGGCGACTGTGATATATCATTCGTCCTACAGCGTATACGAGGTGGAATCAAAGATCTACAGATCCATGGCATGGTCCGCGTGGTGATGAAGCCCCTCATCAGTAAGATGCCGCTGGTGGGAGGGTTGCAGGTGTTCTTCCTCAACAACCCCTCCATAGACTTCAACCTGGTGGGCGCGGCCGACGTACTTGACATGCCCGGCTTTAGCGACATCTTACGTCGTTGCATCGTCGAACAAATATCAAGAATGATGGTGTTACCCAACAAGCTGCCCATCAAATTAAGCGATGAGATACCCACGGTCGACTTGAGGATGCCGGAGCCAGAGGGTGTCCTCAGAATTCATTTGGTCCAAGCCCAGAATCTCATGAAGAAGGATGTCTCCATGTTGGGCAAGGGCAAGTCTGATCCGTACGCTATAATAACAGTTGGGGCTCAACAGTGGAAGACAAAGCACATTGACAACAACATCAACCCTAGATGGGAATTCTGGTGCGAGGCGCGAATTATGCAAACACTTGGGCAGGCGTTGGACATTGAAGTGTTTGACAAGGACGAGGGGAACGATGACGACAAACTGGGCAGGTTCTGCTTCACCAATTATATAATTTATGCTATAATAAGCTCGAGAAAGAGCCAGGTGCTGCAGTGCGAGCTGTGGGACTGGGACCCGGGGATGGGCATTCAGAACGATGATTACCTCGGCAGATGTTCCTTAGATATATCTCAAGTTGTCCGTGCTGGACGTTTGGACACGTGGCAAACACTGCAACAGGCTAAGACCGGTAAGGTACATTTGCGTCTATCGTGGCATCGCTTTTCCACTGACTTGTTGGATCTCAGCCATGCTCTAACATCGACTCAACTGGTAAAGAACGCTGAACTGAGTTCGGCAGTTCTATCCGTCTACATCGATTCTTGCAAACATTTGCCTAACGCTCGTGCACAGTCCCGTCCTGATCCATACCTCGTGGTAACGGTTGGCAAGAAGAGTGAGAATACTGGAGTACAAATGAGAACAGACAGCCCCGTCTACGAAATCGGATACTCCTTCTTGGTACAGAACCCTGAGATTGATGTACTGGAAATAAAGGTCCTCGATCAAAAGACAGGAAACCAGCTAGGAATGCTGAGCTACGGCATATCAGCGCTTTTGAAAGAAAAAAATTTTACTATGTTGAATCAACCGATGAACCTACAAAAATCTGGCCCCGAATCTAAAATCATCATTGCGGCTCAATTGAAGATCCTTAAAGAGGCTGTCAAGGAAGAGGACTTTGATGAAGAAACAGTTTCCGTGGCAAGCGAACCATCGGATGACCGCACTGAGGACAAAAAACCGGATCCACCGAGCACGGAAACCACGGCTGTTGCACCAGCTGTTCCATCGAACACGGATCTGAAGAATATGGAAGACACTCCCCCAGCTACCGACACTATCGACAATAACTCCGAAAAATCTATCCCCGTCGAACAAATTATAAAGGAAGTTGATGTACCTCAAGAGAGTCAAGCGCCGTCTGAACGTGATTCACCAAAATTAATTCACAGGACCTCTTCAATAACGACATCAGCTGGCGATGGCCTCCACAGGCTCTTAGTCCTCGATCAAAAGACAGGAAACCAGCTAGGAATGCTGAGCTACGGTATATCAGCGCTTTTGAAAGAAAAAAATTTTACTATGTTGAATCAACCGATGAACCTACAAAAATCTGGCCCCGAATCTAAAATCATTATTGCGGCTCAATTGAAGATCCTTAAAGAGGCTGTCAAGGAAGAGGACTTTGATGAAGAAACAGTTTCCGTGGCAAGCGAACCATCGGATGACCGCACGGAGGACAAAAAACCGGATCCACCGAGCACGGAAACCACGGCTGTTGCAGCAGCTGTACCATCGAATACAGATCTGAAGAATATGGAAGACACTCCCCCAGCTACCGACACTATCGACAATAACTCCGAAAAATCTATCCCCGTCGAACAAATTATAAAGGAAGTTGATGTACCTCAAGAGAGTCAAGCGCCGTCTGAACGTGATTCACCAAAATTAATTCACAGGACCTCTTCAATAACGACATCAGCTGGCGAGGCTGGACTTGGGAGGATTCTGTTATCTCTACGTTACAGCATGCAGAATCAAACATTATATGTTGTTGTACACAAGATAATGAATATACCTCTCAAGGACCCCACCAATGTCCCGGACCCATATGTTAAACTATACCTGCTACCTGGTCGATCTAAGGATTCCAAACGCAAAACTGTGGTTGTGAAAGACAATTGTATGCCGGAGTATGACGAACAGTTTGAGTGGAGCATCCCGCTAGCTGAGCTTCACTCCAGACAGTTGGAGGTGACCGTCGCCACGCACAAAGGATTCCTCGGTGGAAGTCCTGTTATAGGACAGGTAATAGTTCACCTGAACCAGTATGACTTCCGGGAAGCAAAGACCCTTTGGTTTGATCTTCTGCCTGAAACTTCACCGAGAGAGTAG

Protein sequence:

>DPOGS204224-PA
MPVNSKFALPVSSDENLSVLSIIYRFFKKVSIVGAVYLVGYMQWSVAWLLGPVVLSVMRDQWRRDSEYRRNLAKTAALSSEKDIVLAKLDDLPAWVFFPDVERAEWLNRILLQVWPNVNHYARTLLKDTIEPAVAESLANFKLNGFKFERMILGTIAPRVGGVKVYDKNLSRDEIIMDVDLFYAGDCDISFVLQRIRGGIKDLQIHGMVRVVMKPLISKMPLVGGLQVFFLNNPSIDFNLVGAADVLDMPGFSDILRRCIVEQISRMMVLPNKLPIKLSDEIPTVDLRMPEPEGVLRIHLVQAQNLMKKDVSMLGKGKSDPYAIITVGAQQWKTKHIDNNINPRWEFWCEARIMQTLGQALDIEVFDKDEGNDDDKLGRFCFTNYIIYAIISSRKSQVLQCELWDWDPGMGIQNDDYLGRCSLDISQVVRAGRLDTWQTLQQAKTGKVHLRLSWHRFSTDLLDLSHALTSTQLVKNAELSSAVLSVYIDSCKHLPNARAQSRPDPYLVVTVGKKSENTGVQMRTDSPVYEIGYSFLVQNPEIDVLEIKVLDQKTGNQLGMLSYGISALLKEKNFTMLNQPMNLQKSGPESKIIIAAQLKILKEAVKEEDFDEETVSVASEPSDDRTEDKKPDPPSTETTAVAPAVPSNTDLKNMEDTPPATDTIDNNSEKSIPVEQIIKEVDVPQESQAPSERDSPKLIHRTSSITTSAGDGLHRLLVLDQKTGNQLGMLSYGISALLKEKNFTMLNQPMNLQKSGPESKIIIAAQLKILKEAVKEEDFDEETVSVASEPSDDRTEDKKPDPPSTETTAVAAAVPSNTDLKNMEDTPPATDTIDNNSEKSIPVEQIIKEVDVPQESQAPSERDSPKLIHRTSSITTSAGEAGLGRILLSLRYSMQNQTLYVVVHKIMNIPLKDPTNVPDPYVKLYLLPGRSKDSKRKTVVVKDNCMPEYDEQFEWSIPLAELHSRQLEVTVATHKGFLGGSPVIGQVIVHLNQYDFREAKTLWFDLLPETSPRE-