Monarch geneset OGS2.0

DPOGS213638
TranscriptDPOGS213638-TA3327 bp
ProteinDPOGS213638-PA1108 aa
Genomic positionDPSCF300165 - 148388-162243
RNAseq coverage779x (Rank: top 17%)
Annotation
HeliconiusHMEL0045850.082.19% 
BombyxBGIBMGA004579-TA0.084.60% 
DrosophilaCaps-PD0.059.24% 
EBI UniRef50UniRef50_F4W7M30.066.58%Calcium-dependent secretion activator n=12 Tax=Pancrustacea RepID=F4W7M3_ACREC
NCBI RefSeqXP_972169.20.068.53%PREDICTED: similar to Calcium activated protein for secretion CG33653-PB [Tribolium castaneum]
NCBI nr blastpgi|2700149660.070.39%hypothetical protein TcasGA2_TC013590 [Tribolium castaneum]
NCBI nr blastxgi|2700149660.069.77%hypothetical protein TcasGA2_TC013590 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.8e-10protein binding
KEGG pathway 
InterPro domain[566-672] IPR0104393.6e-25Calcium-dependent secretion activator
[264-378] IPR0119931.8e-10Pleckstrin homology-type
[265-376] IPR0018497.4e-07Pleckstrin homology domain
Orthology groupMCL12011 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213638-TA
ATGACTGAGGTATATTTTAAAAAGTTGGACTCCGCGGACGAGCAGGCTGCTGCTATCAGAAGAGAACTGGATGGACGGATGCAGAAAGTCAATGAGATGGAGAAGAACCGCAAGTTGATGCCGAAGTTCGTGTTGAAAGAAATGGAGTCCCTGTACATCGAAGAGCTGAAGTCTTCCATCAATTTGTTGATGGCCAATTTAGAATCGCTCCCCGTGTCAAAGGGCGGCGCTGATTCGAAATACGGTCTTCATAAGATCAAGCGGTATAATCACAGATCGAATGCTGGGCCAGACCCGAGTCCAAGGTTGAGTAAGCAGCTATGTTTCAGATCGCAAGGGTCTCTAGCGAACAAACTGACTGGCGAAGGAGACGGCGGCGACGTGGACACCCAGCTCACCAAAATGGATGTAGTACTTACATTTCAAATAGAGGTAGTGGTTATGGAAGTTAAAGGTCTCAAGTCGCTGGCACCTAACAGAATAGTTTACTGCACGATGGAAGTGGAAGGAGGAGAAAAATTACAAACCGATCAGGCGGAAGCCTCGAAGCCAATGTGGGACACCCAGGGTGACTTTAGCACGACTCAGCCATTACCGGCGGTGAAGGTCAAGCTGTACACAGAGAACCCTGGAGTGCTGTCCCTGGAAGATAAAGAATTAGGCAAAGTTGTATTACGACCTACTCCCTTGTCTAGCAAGGCTCCGGAATGGCATCGCATGACCGTACCTAAGAACTTACCGGACCAGGATTTGAGAATAAAAATCGCTTGCCGAATGGATAAGCCTTTGAACATGAAGCATTGTGGCTACCTCCACGTTATCGGTAAGGCTGTCTGGAGGAAGTGGAAGCGGCGGTACATGGTGCTGGTACAAGTGAGCCAGTACACCTTCGCACTTTGTTCGTATAAGGATAAAAAGTCCGAGCCCGCGGAGATGATGCAACTCGATGGGTTCACCGTAGACTACATTGAACTGGCCAGTGCGCAACTGATGGTCGGCCAAGAATTGGAAGGCGCGAAGTACTTCATGAATGCGGTGCGTGATGGCGAGTCAGTGCTCATGGGAGTGGGAGATGAGAACGAATGTCACTTGTGGGTCATGGCCTTGTACAGGGCCACAGGACAAAGCCACAAGCCCACACCGCCCACTACATCCGACTCGCATATTAAAATTTTGGGGGATGCTGACAAAGCTCGCAAACACGGTATGGAGGACTACATTCAGGCGGACCCATGCCAGTTCGACCACCATCAACTCTTCACCACGTTACAATCTCTCACCCTGAAGTACCGCTTGCAGGATCCATACTGTTCTTTGGGCTGGTTTTCACCTGGGCAAGTGTTCGTGTTGGACGAGTACTGCGCTCGTTACGGCGTGCGTGGTTGTTACCGTCACCTCTGCTACCTGTCCGACCTCCTCGACGTCGCCGAGAGCGCCCAGCAGACCGTGGACCCCACTCTCATGCATTACTCCTTCGCATTCTGCGCCAGCCACGTTCATGGGAACAGTGTAAGCGACCCTTTCACAGTACGGCGTTGCCTGGCAGGCCCGATGGCGTCGGCAGCATCACGGTCCAGGATCTTAAATACTTTCTCTCCATACAGTGCCCATCCTTTGATAGGCGCCACTCTGTCTCTCCTAGAACGTGTCCTCATGAAGGACGTTGTGACACCGGTCGCTCCAGAAGAAGTCCGCAGTATGATACAGACCAGCTTAGAGAATGCGGCGCTACTTAACTATACTCAGCTCAGCCAGCAAGCTAATATTGAAGAGGATCTCCGTGGAGAGACCATGGTGACCCCGGCCAAGAAGTTAGAAGATCTGATACACCTGGCAGAGCTGTGTGTGGACCTGTTGCAGCAGAACGAAGAGCACTACGCGGAGGTTTATAACACAAAACCGGACTCTAGCGGTCAACAAAATGCGTTCGCTTGGTTCTCTGAGCTGCTGGTGGACCATGCAGAGATTTTCTGGGGTCTGTTCGCCGTAGACATGGATCGCGTGCTATCAGAGCAACCCCCGGATACTTGGGATTCATTCCCACTGTTCCAGATCCTAAATGACTATCTGAGGACCGATGAAAACTTGCGTAACGGTCGTTTTCATGAACATTTGCGTGACACCTTCGCGCCGCTGGTGGTCCGTTATGTGGACCTCATGGAGTCATCCATCGCTCAATCCCTCCACAAAGGATTCGAAAAGGAGCGCTGGGAGATCAAAGGTAATGGTTGCTCGACCAGCGAGGACTTGTTCTGGAAGTTGGACGCGTTGCAGTCCTTCATCAGGGACCTGCACTGGCCGGAACCAGAATTCAGATCTCACCTGGAGCAAAGGCTGAAGCTGATGGCGAGCGACATGATGGAAACCTGCATACAAAGAACTGAAGCTTCTTTTCAGGCGTGGTTGAAGAAGAGTGTGACCTTCATGTCGACGGACTACATCCTGCCGGCCGAGATCTGCGCCATGGTGAACGTGGCCTTAGACGCTAAGAACCAGGCGCTCAAGCTGTGCGCCGTGGAAGGAGTAGACATCTCTGGTATCGTGTTGAACGACGACACTTTTCTAACATCCTGTATAGTGTTAGCGTGTTTGTCCGAGCATGGCATGACGCAAGCAATGGCAGCGCGCACGCAACCATACCAGTACCATGCTAAAATGGATGCTCAGATCGAGGCCTGTCTGCACGCGATGTCCACGGGGATGACAACGAGGTTGTCGGCCGTGCTGGACGCCACCCTCGCCAAGATCGCGCGCTTCGACGAGGGCAGTTTGATAGGGTCGATCCTGACGATCGCGAATGTCTCCGGGTCAGGGAAGGACATTGGCCAGGGTTACGTGAACTTTATGAGGAACTCCATGGACCAAATCCGATCGAAGGTGACAGACGAGCTCTGGATCCTGCAGCTGTTTGAGCAGTGGTACTCAGTCCAGATAGGCGCCATCTCCATGTGGCTCGGGGAGCGCCCGGCCCTCCATCCTCAGCAGGTCGCATGTCTCTCCATCGTTCTCAAGAAAATGTACAGCGACTTCGAGCTCCAGGGTGTGATTGATGACAAGCTGAACTCCAAGCAGTACCAGGCGGTGGCGGGCAGGATGCACACCGAGGAGGCCACCTGCAGTCTACTGCTGGCTCAACAGGACCAGGGAAGCGGAGACGACGGAGGGTCTGAGGAGGGAACCAGGAGCGGCAAGTCAAAGCTGGAGGCTCTCACAGAAGACGCCAAGCTAGGAAACGTGACGGCTGTTGTTGGAAAGATGGGCAACATGTTCGGGCGAGGTATCGGCGAGCTCTCCACCAAGCTGGGAGGCGCCTCTTCCTGGTTCTAG

Protein sequence:

>DPOGS213638-PA
MTEVYFKKLDSADEQAAAIRRELDGRMQKVNEMEKNRKLMPKFVLKEMESLYIEELKSSINLLMANLESLPVSKGGADSKYGLHKIKRYNHRSNAGPDPSPRLSKQLCFRSQGSLANKLTGEGDGGDVDTQLTKMDVVLTFQIEVVVMEVKGLKSLAPNRIVYCTMEVEGGEKLQTDQAEASKPMWDTQGDFSTTQPLPAVKVKLYTENPGVLSLEDKELGKVVLRPTPLSSKAPEWHRMTVPKNLPDQDLRIKIACRMDKPLNMKHCGYLHVIGKAVWRKWKRRYMVLVQVSQYTFALCSYKDKKSEPAEMMQLDGFTVDYIELASAQLMVGQELEGAKYFMNAVRDGESVLMGVGDENECHLWVMALYRATGQSHKPTPPTTSDSHIKILGDADKARKHGMEDYIQADPCQFDHHQLFTTLQSLTLKYRLQDPYCSLGWFSPGQVFVLDEYCARYGVRGCYRHLCYLSDLLDVAESAQQTVDPTLMHYSFAFCASHVHGNSVSDPFTVRRCLAGPMASAASRSRILNTFSPYSAHPLIGATLSLLERVLMKDVVTPVAPEEVRSMIQTSLENAALLNYTQLSQQANIEEDLRGETMVTPAKKLEDLIHLAELCVDLLQQNEEHYAEVYNTKPDSSGQQNAFAWFSELLVDHAEIFWGLFAVDMDRVLSEQPPDTWDSFPLFQILNDYLRTDENLRNGRFHEHLRDTFAPLVVRYVDLMESSIAQSLHKGFEKERWEIKGNGCSTSEDLFWKLDALQSFIRDLHWPEPEFRSHLEQRLKLMASDMMETCIQRTEASFQAWLKKSVTFMSTDYILPAEICAMVNVALDAKNQALKLCAVEGVDISGIVLNDDTFLTSCIVLACLSEHGMTQAMAARTQPYQYHAKMDAQIEACLHAMSTGMTTRLSAVLDATLAKIARFDEGSLIGSILTIANVSGSGKDIGQGYVNFMRNSMDQIRSKVTDELWILQLFEQWYSVQIGAISMWLGERPALHPQQVACLSIVLKKMYSDFELQGVIDDKLNSKQYQAVAGRMHTEEATCSLLLAQQDQGSGDDGGSEEGTRSGKSKLEALTEDAKLGNVTAVVGKMGNMFGRGIGELSTKLGGASSWF-