Monarch geneset OGS2.0

DPOGS216117
TranscriptDPOGS216117-TA1158 bp
ProteinDPOGS216117-PA385 aa
Genomic positionDPSCF300182 + 144682-148923
RNAseq coverage1777x (Rank: top 7%)
Annotation
HeliconiusHMEL0154469e-8387.10% 
BombyxBGIBMGA009282-TA2e-7281.68% 
DrosophilaSynd-PB1e-4549.04% 
EBI UniRef50UniRef50_E2C1C81e-4771.68%Protein kinase C and casein kinase substrate in neurons protein 2 n=5 Tax=Harpegnathos saltator RepID=E2C1C8_HARSA
NCBI RefSeqXP_001605519.13e-5456.41%PREDICTED: similar to membrane traffic protein [Nasonia vitripennis]
NCBI nr blastpgi|3800134522e-5359.18%PREDICTED: LOW QUALITY PROTEIN: protein kinase C and casein kinase substrate in neurons protein 2-like [Apis florea]
NCBI nr blastxgi|3071871135e-8145.57%Protein kinase C and casein kinase substrate in neurons protein 2 [Camponotus floridanus]
Group
Gene OntologyGO:00055158.3e-21protein binding
KEGG pathwaytca:6643241e-41 
 K10408 (DNAH)maps-> Huntington's disease
InterPro domain[328-384] IPR0014528.3e-21Src homology-3 domain
Orthology groupMCL10606 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216117-TA
ATGGCTGAACGAGTGTCGAAGTGTCGCGACGACGTGGCGAAGAATCGCGACAAGTACCAAGCGGCGCTGGCGGAGATAACGTCCTACAACCCTCGCTACATCGAGGACATGACCGGCGTGTTCGACAGGTGCCAGCAGATGGAGGCTCAGAGGCTGTCCTTCTTCAAGGACGTGCTGTTCAGCTTCCATAAGTGTCTGGATATAAGCAAGGAACCCACATTACCACAAATATACGAGGAGTTCCATCACACGATCAACAACGCCGATCACGGCAAGGACCTCAAGTGGTGGGCGAACAACCACGGCGTCAACATGGCTATGGCCTGGCCGCAGTTCGAGACAAATAAACCTTGTAAGGAGGATCCTCCGGAGTCTGAGCGGAAACAGTTCAGATACCTTCCCACGGCCGCCACCGCTATGATCGAAAGGATTGCCTCCGCGACCTCCAACCGACCAGCCATGAGAGACCTCTGGATCCGCACTCGAGCAGCAATCCGCAGAGCTCTACCCAGCATCTGTTACGGAGATGAGAGGGAATGTAGAGTAGCACCGTCAGACACCACCAGCTCGTCCAACGAGACAAGATGCTACAACGTAGCATTCATGGAGTACACGGAGGAGTTCCGGGACATCGCTAAGGGGAAGTCCAAGGAGAGTCTGCCCACGGGACCCATCACACTCCTCAACCAGAGACCCGTCAGCGAGGATGAACTTCCGCCGATAACGAACAACAAGTCCGGTAAAATAAACCACGCCGAGCCAACAACAAACACAACAGTCGTCAACAACAGCAGCGCCAACAACACCATCGACAAGAAGACGATCAGCGCACCGATAGCTGTCACAAACGGTACAGCATCAGTACCGAAATCAGCAAAAACATCACCGGCGAAGGACAGCGTGGGTAAAGACAACCCGTTCGAGGAGGAGGAGTGGGACGAGGACTCCGGGGGCGCGCTCACCGACACCGGCGAGCCCGGCGTGCCCGTGCGCGCGCTCTACGACTACACCGGCGCCGAGAGCGACGAGCTCAGCTTCAGACAGGGAGATTTATTCGAGAAGCTCGAAGACGAAGACGAGCAAGGCTGGTGTAAGGGAAGGAAAGACGGGCGAGTGGGGCTCTACCCCGCCAACTACGTGGAGCCCGTCGGAAACTAG

Protein sequence:

>DPOGS216117-PA
MAERVSKCRDDVAKNRDKYQAALAEITSYNPRYIEDMTGVFDRCQQMEAQRLSFFKDVLFSFHKCLDISKEPTLPQIYEEFHHTINNADHGKDLKWWANNHGVNMAMAWPQFETNKPCKEDPPESERKQFRYLPTAATAMIERIASATSNRPAMRDLWIRTRAAIRRALPSICYGDERECRVAPSDTTSSSNETRCYNVAFMEYTEEFRDIAKGKSKESLPTGPITLLNQRPVSEDELPPITNNKSGKINHAEPTTNTTVVNNSSANNTIDKKTISAPIAVTNGTASVPKSAKTSPAKDSVGKDNPFEEEEWDEDSGGALTDTGEPGVPVRALYDYTGAESDELSFRQGDLFEKLEDEDEQGWCKGRKDGRVGLYPANYVEPVGN-