Monarch geneset OGS2.0

DPOGS209185
TranscriptDPOGS209185-TA4083 bp
ProteinDPOGS209185-PA1360 aa
Genomic positionDPSCF300061 + 257087-263300
RNAseq coverage186x (Rank: top 49%)
Annotation
HeliconiusHMEL0030710.082.37% 
BombyxBGIBMGA011538-TA0.079.60% 
DrosophilaGirdin-PA0.042.59% 
EBI UniRef50UniRef50_Q7QCA40.044.29%AGAP002527-PA n=3 Tax=Culicidae RepID=Q7QCA4_ANOGA
NCBI RefSeqXP_001868808.10.045.18%hook protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700682850.045.18%hook protein [Culex quinquefasciatus]
NCBI nr blastxgi|1571182490.045.99%hook protein [Aedes aegypti]
Group
Gene OntologyGO:00002265.7e-20microtubule cytoskeleton organization
GO:00080175.7e-20microtubule binding
GO:00057375.7e-20cytoplasm
KEGG pathway 
InterPro domain[15-588] IPR0086365.7e-20HOOK
Orthology groupMCL10999 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209185-TA
ATGGCCGCATCTGCTACAGAGATTGACGATTTCTTAAGCGGACCTCTTGTTTCGTGGCTCAAATCATGTATCCCCAACCCAGACACAATACAGGAATATTCCTCCTTATTCAATGGAGATATTTTATATCACATTTATCTCCAAATAGACCCAGAGCCCTCCTTCCATATAACAAAGCTGACAGGTCTTGAAGACCAAGCTCTTATACTAGGAAGAGTGAAAAATTTTGATGCCATCATAAAAAATTTGAAAAATTTATATGACGAGGAGTTAGGGATGACACTTTTAGTTGTACCAGAATGTATATGCCTAGGGAAGGCACCGGAAACAAAGCAGGGTCTTGAAAATATGAAGCTGTTAGTTCTATTGCTGCTTGGTGCGGCTGTTCAATGTCCGAATAAAGAGATATTTATCACCAGAATCAAGGAGCTGGATGTAGACCTACAACATAACATTGTGCAATGCATAAAACAGGTAACGGACATGCAAACAGTAGTTCTGACACCGGATGCAATAGACCTCTTCCAATCTCCGACAATGTTTAATCATATGCGGAGATTGGCCAAGGAGAGAGATCATTATCTGCAGAACTGGGCGACTCTTGTGCTTAACGAGGACTTATGTGAAACTGAAAATGAGAATAAGAATGTCAATCGATCCACGCAGAACATCAGTCAGAACAACGGTGAAAGTCAACATTTGGCCGTAGAGCTGGCTGATTGGAAGGCAAGGTTGAGGAAACAAAGGCAGGAATTGGAGGAAAAATCGGAGCAGTTGTCTGAATGTAGAGAAGAATTGGAACATACAAAATTAGTCCTCGCGAAGTTGCGTACGGAGAGCCAAGAATGGTTCAATGAAGCGAGAAAAGCTACCGGATATAGAGACGAAGTGGACGCCCTGAGGGAGAAGGCTGAGAGATGTGATCGGTTAGAACAGGAAATCCAGAGATACCGGGACCGGTTGGCCGATGCGGAGTATTATAAGACGCGGGTTACAGAGCTGCGAGAGGATAACAAGGCTTTGATGGACACCAGGGACGCTTTGGAGGAACAGCTGCAAAGGGCGCGCAAGAGGGCAGAGCAGTGTCTGTCATTGGAAGCGGCCATGATAAAACTCAAGAGAGAAGCCAACGACATCGCTCTGGAAAGGGATGCTGATCAGCAGAAGATTCAAGAATTAATTGAAGAGAACAATCATTTGCAGTACATAACGAGATCGGTGTTGAGCGAAAGCAACAACAGTAACCTGGACACGGACAACGAGTGCGAGAACACCTTGGAGTCGGGAGAAAATAGTCTATCGGAACAATTGACAAGCAACGCCCAGGCCAGAGCGCTGAAACTCGAACTGGAAAATAAACGGCTGCTGTCGACTATAGACAGTCTGAGGGAACAATCGCTGCTAGAAGGCAGCGACAAAGTACTGGATCTAGAGAAAGAAAAGAAAAGACTAGCCCTTAAATGCGAACAATTACAAGAAAATTGCAACAGATTCAAGCAACAGAACTCTGAACTAGAAGAAGTGTTCAAGAACGCGCTGGAAGAGAACAGAAAACTGCAAGACTCTATCGACAGTCAGAAAGCTTTCATAGACAGACAATCTATAGATAGAGATTCAGAGAAAAACAAATTACAGGACTTCGAAAATCATTTGGAGTCATTGACCAAAGACAAGCAGAGGATGCAAATGCTATGCGAGTCAATACAGAGGAGAGCCGACGATCTCGAGAGGACTTTGGACGCTAGGACCAAGGAACTCAACAATATCAAACCTGAAGCAGAAAAGGCGACCTTACTGATAATACAAACAGAGGAATTGAAAACTAAGCTGACGTACAGCGAGAAGGAAAGCCACAGCTTACAGAGAGAAGTTAACAAATTACGTGAAGCGGTCGAGGAGAAAGACGTCATCATAGATAGGATAAGCACTGAGATAGAAATAAAGAATAAAGAAATAGAAAGATTGGCTAGACAGATAGAGATCAATCAGAATTTGAGTAGTAGACTGCAAGACTTGGAACAGAAGACTCAGGAGTTGAAGTCACATAAGAAAGTGGACACGGAAACGATACAGATGCTGCAGAAAGATCTCATATCAGAGAAGGTTAATTTTGATAAATTAAGAAATTGTATGGAGAAGATCGGCATCAATACATCCGATGTCATATCCAAAGATGTCAGCGTGGAGGACTTGTTGGACAGAATAATAACCAACGTAGACCACGAGGCTTTAATATCTGAAATAACTTCAAAGACTATTTGCTCAAAATCGGTACCGTGTAACTGTCATAATTCGGAAAAACCGGAAGTCGACGAAAACGTTATTAACCCGCAGATCGAACAACTGAATGCGGACTTGGCAGCCCTACAAGTGTCGCTTGAGAATTGTCAAGCTGAAAACGCTAAGCTACAAGTTAACATAGCGACGTTAAATTCACAAAACGGCTCCCTCATCTCCCAACAAATGACGTTACAACTGGCCAACAGTCAGTTGGCGGCGGAGAAAGACGAAATCATGAAACAATTGGAAGTTTTAAAAGACAAACAAGATGATCTACTCCGAGACCAAGTGGCGCTGCAAACGTTACACGAACAGTTGAACATGGAATACGAAATGTTACTCAGTGAGAAGGAACCCGTTAAGGCCGCGATAAGAGATTTGAAGATCGAGAACAGGGAGCTGAAGGAAAAACTGACCGGCTACGAGAAAAAAATAGCGGATTTCGAATTAGAGAGAGAAAATCTTAAGATAGAGTCGAGAAATCTGACAAACCTTAGAGCTGAACACTCCAAGTTGAAGGAGGACTTTAGAAATTTGTTCACAGCGAGCGATAGGCTTAAAAACGAATACAGGAACATGCAAGACGAGTATAGGAATCTTAGAAGCGAAGTCTCCCAACTCAAACTAAGGAACACGGAAATGTCCGGAGAAATTAACACGAAAGTGGAAGTCATAACTAGTATGGAATTAGAAATAAGCAAAACAAATCAACACTGCGAGATGCTCATACTGATGAACAAAAACTTGGACGCCGACAGACGGTCCTTGATGGACCACGTGTCACAACTCCTGACCCAGTATCACGAGCTGTTGGCGCATTCGTTGAAGGACAAGCAGCATTATCACGAAGAAGAGAAGATGTTCGCCGACAGAGTGAATGCCTTATGTCGACAAAAGGAAAAACTAGAGGAGAAGATCATGGAACATTACAAAAAGTTGGACAACTGCACAACAAAGAGACGCGGCTTCGGCGCCTCGTTCGTGAAACGAGTGCGAAAAGCCGGCACTGATTTGATCAACAAAGTTCCGTCCAGAAACAAGAGGATAGAAGACGCCAACAGATCCAAGTCACAACTAACACTTGCTGGATCAGAATCCGGCGAATCAGATCCCGGACAGGATGTAGAAAAGTTGTCCAAACATTCAGATTACGATCAGGGATCTTCGAACCTGAATATGGAGTCACCGAGACATAGCGTAGATTCAAGCTTCAACAGACGCCTTGAGGATATATTCAAGAAGTCCGACAACATCGATATGAGTGGATCTATTCATAGTCTTGACCCCATCAGGTTGAATGAAGGTAATTTTGTAAGGAGACTGTCAGCGGCATCCATCCACAGCGGGGGTGGTGACGATGCGCTCATTAGGGCTTCTCTAAGACGACGACCGCATAAAGCCGTTCCGCCTACACATAGAAATAGTTTTCAAGGACTGGAACCCGATCCTAGCCTGCCTGCAGCTTCAAATCCTTCCCCGGTCTTTGGAACAGCAGGAACCCGCAGAACGGTTTACTTGGCCGAGGATAACCCAGAAGTTAACCTGAATAGCAAACCCCAGAGCACTCCGATCAAAGAGAATCCCACTTACTTGGTATACAATAGAATATCAACAGTCATAGGAGACGGTGCCTCTCAGAGTAACAGTGATAGATTAAATGACAGATCCGTGCCGGAACAATCCAGGAACACAGAAGAAAATCCAGTAGAGCGAGAAGCTTCTAGAGATAAGTCCGTCAACCGCAAGGATGTCAATGAATCTTCTAAAGAAACTGCCATCTGGTACGAATATGGGTGCGTTTGA

Protein sequence:

>DPOGS209185-PA
MAASATEIDDFLSGPLVSWLKSCIPNPDTIQEYSSLFNGDILYHIYLQIDPEPSFHITKLTGLEDQALILGRVKNFDAIIKNLKNLYDEELGMTLLVVPECICLGKAPETKQGLENMKLLVLLLLGAAVQCPNKEIFITRIKELDVDLQHNIVQCIKQVTDMQTVVLTPDAIDLFQSPTMFNHMRRLAKERDHYLQNWATLVLNEDLCETENENKNVNRSTQNISQNNGESQHLAVELADWKARLRKQRQELEEKSEQLSECREELEHTKLVLAKLRTESQEWFNEARKATGYRDEVDALREKAERCDRLEQEIQRYRDRLADAEYYKTRVTELREDNKALMDTRDALEEQLQRARKRAEQCLSLEAAMIKLKREANDIALERDADQQKIQELIEENNHLQYITRSVLSESNNSNLDTDNECENTLESGENSLSEQLTSNAQARALKLELENKRLLSTIDSLREQSLLEGSDKVLDLEKEKKRLALKCEQLQENCNRFKQQNSELEEVFKNALEENRKLQDSIDSQKAFIDRQSIDRDSEKNKLQDFENHLESLTKDKQRMQMLCESIQRRADDLERTLDARTKELNNIKPEAEKATLLIIQTEELKTKLTYSEKESHSLQREVNKLREAVEEKDVIIDRISTEIEIKNKEIERLARQIEINQNLSSRLQDLEQKTQELKSHKKVDTETIQMLQKDLISEKVNFDKLRNCMEKIGINTSDVISKDVSVEDLLDRIITNVDHEALISEITSKTICSKSVPCNCHNSEKPEVDENVINPQIEQLNADLAALQVSLENCQAENAKLQVNIATLNSQNGSLISQQMTLQLANSQLAAEKDEIMKQLEVLKDKQDDLLRDQVALQTLHEQLNMEYEMLLSEKEPVKAAIRDLKIENRELKEKLTGYEKKIADFELERENLKIESRNLTNLRAEHSKLKEDFRNLFTASDRLKNEYRNMQDEYRNLRSEVSQLKLRNTEMSGEINTKVEVITSMELEISKTNQHCEMLILMNKNLDADRRSLMDHVSQLLTQYHELLAHSLKDKQHYHEEEKMFADRVNALCRQKEKLEEKIMEHYKKLDNCTTKRRGFGASFVKRVRKAGTDLINKVPSRNKRIEDANRSKSQLTLAGSESGESDPGQDVEKLSKHSDYDQGSSNLNMESPRHSVDSSFNRRLEDIFKKSDNIDMSGSIHSLDPIRLNEGNFVRRLSAASIHSGGGDDALIRASLRRRPHKAVPPTHRNSFQGLEPDPSLPAASNPSPVFGTAGTRRTVYLAEDNPEVNLNSKPQSTPIKENPTYLVYNRISTVIGDGASQSNSDRLNDRSVPEQSRNTEENPVEREASRDKSVNRKDVNESSKETAIWYEYGCV-