Monarch geneset OGS2.0

DPOGS207840
TranscriptDPOGS207840-TA4185 bp
ProteinDPOGS207840-PA1394 aa
Genomic positionDPSCF300042 + 1102254-1115942
RNAseq coverage635x (Rank: top 20%)
Annotation
HeliconiusHMEL0153080.062.13% 
BombyxBGIBMGA005521-TA0.053.28% 
DrosophilaHip1-PA2e-9539.55% 
EBI UniRef50UniRef50_E0VES66e-14047.03%Huntingtin-interacting protein, putative n=10 Tax=Pancrustacea RepID=E0VES6_PEDHC
NCBI RefSeqXP_001604846.11e-14442.86%PREDICTED: similar to huntingtin interacting protein 1 [Nasonia vitripennis]
NCBI nr blastpgi|3454818021e-14342.95%PREDICTED: huntingtin-interacting protein 1 isoform 2 [Nasonia vitripennis]
NCBI nr blastxgi|3454818021e-14137.50%PREDICTED: huntingtin-interacting protein 1 isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00037795.4e-63actin binding
GO:00055431.4e-62phospholipid binding
KEGG pathwaytca:6601362e-131 
 K04559 (HIP1)maps-> Huntington's disease
InterPro domain[1187-1381] IPR0025585.4e-63I/LWEQ
[58-319] IPR0114171.4e-62ANTH
[54-176] IPR0138096e-31Epsin-like, N-terminal
[59-171] IPR0089421.1e-23ENTH/VHS
Orthology groupMCL10862 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207840-TA
ATGGCAAATCCCGCTGAAAAACGCTTCTATCAACTGAACGCTACGGTTGTTGACATCGGCTTTGTTTACAAACGCCATTCTAATAATATGGCGCGCGAACAACGCAACGAAACGCCGTGTTCTAAATTTGAAATGAGTTTCATACGCTCACGCGTACATCGTATGAACACTTTAGCAATTCAAAAGGCTATAAATTCTATAGAGACGCCGGTTAAAGAGAAACATGTCCGCAGCACAATCATTGGGACGTTCCAGGAACAGAGCGCCGTCACATACTGGATGGTGGCCGTCCGTCTTCCTCTTCAGGACAACAGGATCGTGTCCTGGAAGTTCTGCCACGTCACACACAAGCTGCTCCGCGAGGGTCACCCCGCCTGCCTCGATGACTCGCAGCGGCACATTAACATGATTGAGAATTTGGGAAAGCTATGGGTTCACCTGCGCGAAGGTTACGGACGTCTAATAAGCTTGTATTGCAAGCTGTTGGTAACCAAATTGAAGTTCCATTCACGCAACCCTCGCTTCCCTGGCAACATGCAGCTCACGGTGGAAGAGTTGGACGCTATAGCCGAGAATGACGTCAACAATTACTTCCAGCTGTGCGTTGAACTTTTCGACTACATGGACGACATTCTTCAGCTGCAAGCAGCTGTGTTCGACAGCCTGGGCAATGCTAGAGCTAACTCTATGACAGCCAGCGGTCAGTGCCGGCTGGCGGCCCTCATCCCCTGCGCCCAGGACTCCTCGCACATATACGACTGTATCGTCAGGCAGATGTTCAGACTGCACGCCTCGCTACCGCCCGACGTGCTCGACGGCCACAGAGAGAGATTCAGGACACAATTCAAGAAGTTGAGTTCATTCTACAAGCACGCGAGCAGCTTACAGTACTACAGAAATCTTTTAACATTGCCAGTACTGCCTTCCAACCCGCCAAACTTTCTGCAACAGAGTGATTTCGGTACTTACGTCACGCCGGTGGTAACAATCCCTGACCTGCCCCCGGACGAGGTGGATGCTGTCGGGTCACTCATAGATACTTCTGACCCGGTCACTCAGGGCCAGATGCACAGCGAGGTGTCTGATACTCGCAGCACCAACTCGCCAGTTCCAGATCCCATTGTTGAGAGAGACCGACTCATAGAACACTTACAACATGAACTTAAAAGGATGAGATCTGAAGTGTCGCAGATAATACAAGAGAGGAACACAATGCTGGGGTCTATGAGAGAGCATTGTACACGGTTGGAGGCACAGATGCAAAATGTTAAGTCTGAATTGGACGATGAGAGAAATAAGGCTGAAAGTCTGGCGGCTGAGACACCGGAAATCAAAAAGAAATTAATGGACACCGAGGATAAGGCGAAGGTCATAGATGAGAAGTTTCAGAAATTAAAAGGCGCTTACACACAGCTGAGGGAAGAACACATAACGTTGATAAGACAGAAAGCCGAGGTCGACAAGCTCGCGTCCAAGCTGCGAGGCGAGGCCGCTCAGCACGAGGGCGCCACTAGCCTGTTACAGCAACAACTCAACGATCGAATGAAAGATGTCGAGCTTCTGCAACAGAAGGCATCGTCTAGTGAAGAAGTCGAAGCTTACAAAACCGAACTCACGAACCTCAGGACTGAGTTGGAACAGACCAGGCAGAAAGAAGTGGAACTACAAACACTGAGAACCAACTTCGAGGCTCTGGAGATTGAACACAACACAGTCAAAACTGTACAACAAGACAAACTGACAGCTCTCACTAACGATTTGAAAGAGACCAACGAGAGTTTAGAGAAGTTAAAGGCGGACTTCGAGGAGAAGGACAAAGAGCTGAGTAGAGTTAAAGAGGAATTGACAGCGGTGTTACAGAAGAGCGGTGACGAATATAAAACGGCGATTAAAGATAAAGAAGAGGCTTTGAAGCAGTTGGCCGAGATGAAAGCACAGTATCAAGAGGAAAGAGAGGAACGCATTATCCAAACTAACAATTTGCAAGCTGAATTAGAATATATCAAAGCAAAATTAACCGACACCCAGAATAATTTCGATTCCCAATGTCGCAAACTGAACGGAGAACTGAATTCGAAAAATGAGGAGCTGCAAGCTATACTGGATAAGAGAGATGCAGAAGCAAACGAGGCCATAGATAAACTCAGTTCGTTACAAAAACAGATAGAAACCTATCGAAACGATATCGAAACATCGAAGGCGACCATAGACAAACAGAATGAAGAAATAGACGAACTCAACAACAAAATAAACACACTCGAGGAAGATAAAGAGAATTCGATGATAGAGTTTGAAAATATGAATACGCTTAATAATATGTTAGACGAACAATTACAGCAGGAGACACACAAGAGAAACTTGCTTGACAAGGAAATATCAGAGAAAATAGTGGAAAATAAATCTAGACTACGTAGCAAAGATGATGATATAGAAACACTTAAAACCGACATAGAAAAGCTGTTCGTGGAAAGAGACGAAACATTACACGAGAGAAATAATTTACTATCAATAAATCAGGGACTACAAAAATCAATTAAGGAACTACAGAATAAGGAAAATGAATTAACAGCTGAGATTGAAAAGGCGAGAAGAGAAAACGTCACCGTAAGGAATACATTGCAAAGCGAAATAGATAATCTACGGTCCGCGTGTGTTACTATGGAAATAAGTAGAGATAATGTCGTGAGGGAACTGACGGATGAGTTTGGATTAAAAGAGGTCGAATTGAAGAGGGTCATTGCAGACAAAGACGCGGTGTTGGCCAGAGCTAACAAGGACCTGGAGACATTGAAAACTGAGGTTGCAAAGTTGACAGAGATCCAAACTGAAGAGAGGCAAGCGTTGACTAGATCTATGTCTGAGAGAGAGAAAAATATACAGTTGACCGAGGCTAGATTAGAGCACACGGAGAGCGAACGACTCACCGCGGAATGTGAATTACAGGACCTCTTGCAGCAAAACACTATCATGGAGGCCGATCTGATGTCCTTAAAGATACAGCTGGACGAGAAGGACCAGCTCATCAAGAAACAAGCTAGTAAGATATTGGCGTGTGCGACTGAAGCTGCGTTGGATATAACGAACGAGGCGATCTCGGCGTTTGAGAACTCAAACGCTCAAGATACTAACAAAAGAGCCGGCGAACACGCGGCTAAGGCTTTCGAAACAATCGCCAGGAAGCATAAGTTGGAGGGTAACGAGGAGCTGGTGTCGAGGAGCGTCCTCTCAGCGGCACACAACACGGCCAGGGTGTCGTACATCGTGTCTGACGTCACCAACACCACAACTGATATGGAATTAGCTGAAAAACTGAGCAACGACTGCCGGACGATGCTGGCGAACACGAAGCAATGTCTAGAGAATATATCGACGGGTGCCATAGACGTGACGCAGTGTCTGGCGACCGGAGCCAGCGTCACCAGGCTGTCGAGAGCCGCGGCCGATGACGTCACCGACACGAGGGGCGTGGACGACGAGCTCGCCGACATGGACAGGGCCATAGAAGTAGCGGCGAAACAGATTGAGGACATGTTGGCTGCAAGTCGAGCTGGTGACACGGGCGTCAAGTTGGAAGTGAATGGCAAGATCTTGGACGCGTGTACGACCCTCATGGCTGCCGTCAAAGTGTTGGTCCAGGACTCCAGGAAGCTTCAAAATGAACTCGGGGACCCGAAGACACGACAGAACATGTACCGCAGGAACCCTCAGTGGTCGGAGGGCCTGATATCCGCTTCCAAAGCTGTGGTCTTCGCGGCTAAGTTGCTTGTTTCATCCGCGGACGAGGCTGTGGGTGCGGCTGGTCGGGTTGAAGGTGTGTCAGCGGCGGCCCATGAGGTGGCGGGGAGCACGGCACAGCTGGTGGCCGCTTCGAGGGCGAAGGCTCCGCCCGCGACACCCGCGCTCGCGAGACTCACCGCCGCCTCCAGAGCTGTGGCCGCGGCAACCGGCGCTGTAGTGGCCGCGGTCAGAGGCGCCTCCGCATTAGTTAGAGATCAGGAAACCCTGGATACTTCGAATCTTTCACTAACTGCTACCCGGAGACTGGAGATGGAAAGCAAAGTACGTGCTCTGGAACTGGAGAGTGCTTTAGACGCTGAGAGGAACCGGCTGGCTGCTCTCAGGAAGAGACACTACAACCTCGCACAGATGCACGAGAACGGAACTATAACAAACGGAGATGAATGA

Protein sequence:

>DPOGS207840-PA
MANPAEKRFYQLNATVVDIGFVYKRHSNNMAREQRNETPCSKFEMSFIRSRVHRMNTLAIQKAINSIETPVKEKHVRSTIIGTFQEQSAVTYWMVAVRLPLQDNRIVSWKFCHVTHKLLREGHPACLDDSQRHINMIENLGKLWVHLREGYGRLISLYCKLLVTKLKFHSRNPRFPGNMQLTVEELDAIAENDVNNYFQLCVELFDYMDDILQLQAAVFDSLGNARANSMTASGQCRLAALIPCAQDSSHIYDCIVRQMFRLHASLPPDVLDGHRERFRTQFKKLSSFYKHASSLQYYRNLLTLPVLPSNPPNFLQQSDFGTYVTPVVTIPDLPPDEVDAVGSLIDTSDPVTQGQMHSEVSDTRSTNSPVPDPIVERDRLIEHLQHELKRMRSEVSQIIQERNTMLGSMREHCTRLEAQMQNVKSELDDERNKAESLAAETPEIKKKLMDTEDKAKVIDEKFQKLKGAYTQLREEHITLIRQKAEVDKLASKLRGEAAQHEGATSLLQQQLNDRMKDVELLQQKASSSEEVEAYKTELTNLRTELEQTRQKEVELQTLRTNFEALEIEHNTVKTVQQDKLTALTNDLKETNESLEKLKADFEEKDKELSRVKEELTAVLQKSGDEYKTAIKDKEEALKQLAEMKAQYQEEREERIIQTNNLQAELEYIKAKLTDTQNNFDSQCRKLNGELNSKNEELQAILDKRDAEANEAIDKLSSLQKQIETYRNDIETSKATIDKQNEEIDELNNKINTLEEDKENSMIEFENMNTLNNMLDEQLQQETHKRNLLDKEISEKIVENKSRLRSKDDDIETLKTDIEKLFVERDETLHERNNLLSINQGLQKSIKELQNKENELTAEIEKARRENVTVRNTLQSEIDNLRSACVTMEISRDNVVRELTDEFGLKEVELKRVIADKDAVLARANKDLETLKTEVAKLTEIQTEERQALTRSMSEREKNIQLTEARLEHTESERLTAECELQDLLQQNTIMEADLMSLKIQLDEKDQLIKKQASKILACATEAALDITNEAISAFENSNAQDTNKRAGEHAAKAFETIARKHKLEGNEELVSRSVLSAAHNTARVSYIVSDVTNTTTDMELAEKLSNDCRTMLANTKQCLENISTGAIDVTQCLATGASVTRLSRAAADDVTDTRGVDDELADMDRAIEVAAKQIEDMLAASRAGDTGVKLEVNGKILDACTTLMAAVKVLVQDSRKLQNELGDPKTRQNMYRRNPQWSEGLISASKAVVFAAKLLVSSADEAVGAAGRVEGVSAAAHEVAGSTAQLVAASRAKAPPATPALARLTAASRAVAAATGAVVAAVRGASALVRDQETLDTSNLSLTATRRLEMESKVRALELESALDAERNRLAALRKRHYNLAQMHENGTITNGDE-