Monarch geneset OGS2.0

DPOGS203987
TranscriptDPOGS203987-TA3567 bp
ProteinDPOGS203987-PA1188 aa
Genomic positionDPSCF300005 + 1254760-1270460
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0120750.074.21% 
BombyxBGIBMGA002131-TA0.077.41% 
DrosophilaTrpA1-PE0.062.53% 
EBI UniRef50UniRef50_B4HK420.062.35%GM24910 n=10 Tax=Endopterygota RepID=B4HK42_DROSE
NCBI RefSeqXP_001956862.10.064.32%GF10143 [Drosophila ananassae]
NCBI nr blastpgi|2700048050.066.81%hypothetical protein TcasGA2_TC002449 [Tribolium castaneum]
NCBI nr blastxgi|2700048050.066.81%hypothetical protein TcasGA2_TC002449 [Tribolium castaneum]
Group
Gene OntologyGO:00160201.4e-07membrane
GO:00550851.4e-07transmembrane transport
GO:00052161.4e-07ion channel activity
GO:00068111.4e-07ion transport
GO:00055151.2e-05protein binding
KEGG pathway 
InterPro domain[33-390] IPR0206836.3e-53Ankyrin repeat-containing domain
[841-1019] IPR0058211.4e-07Ion transport
Orthology groupMCL16863 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203987-TA
ATGAAGATGGATCCCGGTGGAGAGTTGCAGGTGATGCTGCCACATCAAATTCAAGACAGAATAACCTCCGGTGATGTTTGTAGAATGTCTGACAGCCCATATCGAATACACAAAGCGGCTGAAAGTGGAAATGTGGAAGATTTTATGCGTCTGTACCTAACGGAACCATCCCGAATATCCATCCGCGACTCTAATGGCCGCACTGCAGCTCATCAAGCAGCTGCTAAAAACCACACAAACATCTTGCACTCCATTAATAAATACGGCGGAGCATTAGATATAGCAGACAACGCCGGGAATACGCCGCTTCACTTAGCAGTAGAAAATGAGTCACTAGATGCCATAGATTTTCTACTTCAGCAGAATGTAGATACTTCAAGTTTGAATGAGAAACGTCAGGCACCCATCCATTTAGCCACAGAATTAAATAAAGTATCTGTACTAAAAGTTTTTGTAAAACATAAAACTAAGTTTGACGTTGATATCGAGGGGGAACATGGTCGCACAGCTTTGCATTTCGCTGCTATTCACGACCACGATATGTGTGCTAGAATATTGATTTCTGAATTGGGAGCCCAATGTAAACTTCAATGTAATAATGGGTACTATCCTATTCATGAAGCTGCGAAGAATGCTTCCTCGCGTACTATGGAAGTCTTTCTTCAATGGGGTGAATCTGAAGGTTGTACAAGGGAAAAGATGATGTCCTTACATGACAACGAAGGGAACGTTCCTCTTCATTCTGCCGTACATGGAGGTGACATCAGGGCTGTGGAGCTCTGTTTGAGATCTGGTGCTAAGATATCAGAACAACAATACGATTTTTCCACGCCCGTACATCTAGCCTGTGCTCAGGGCGCTCTAGAAATAGTAAAGTTGATGTTTACAATGCAACCAGAAGAGAAAATGGCATGTTTAATGTCCTGCGATGTACAGGAAATGACGCCTTTACATTGTGCCGCAATGTTCGATCATCCTGAAATTGTTAAGTATCTTGTGAATGAAGGATCTGATCTGAATCCTTTGGATAAGGAAAAAAGGTCTCCTCTACTGTTGTCTGCCTCACGAGGCGGTTGGAGAACTGTTCATACATTTATTCTTCTCGGTGCGAATATGGAACTGAAAGACATAAATTCTCGGAACGTGCTTCATCACGTTGTTATGAACGGAGGTCGTCTAGAAGATTTTGCAACAACTTGCAAAAATCGATGCGAAAAAAGTCTTTCACAATTACTAAATGAAAAAGACAATAACGGTTGTTCGCCCCTACATTATGCCAGTCGAGAAGGCCACATAAGGTCCTTAGAAAATCTCATAAAGCTCGGTGCCTGTATCAATCTCAAGAACAATAATAACGAGAGTCCACTGCACTTTGCCGCGAGATATGGGCGATATCACACAGCGTGCCAGCTATTAGATTCTGATAAGGGAACTTTTATCATAAATGAAAGTGATGGGGAAGGACTCACACCATTGCATATCGCATCACGGGAAGGTCATACGAGAGTGGTACAATTGTTACTCAATCGAGGAGCTTTGCTGCATAGAGATCATAATGGACGCAACCCACTTCATCTTGCAGCAATGAGTGGATATACTCAGACCGTAGAACTTTTGCATTCAGTTCACTCACATTTGCTGGATCAGACTGATAAAGATGGAAATACACCACTTCACTTAGCTACAATGGAAAATAAGCCTAACTCTATAGCATTGCTGTTATCTATGGGCTGTCAGTTGAGTTACAACTCGTTGGAAATGAGTGCTATCGACTATGCAATTCATTACAAATTCCCAGAGGCTGCTCTAGCAATGGTGACTCATGAAGAACGAGCTAAGGAAGTTATGGCTTTACGATCAGATCGTCACCCTTGTGTCACTCTCGCACTGATAGCTTACATGCCTCGAGTGTTTGAAGCAGTTCAAGATAAGTGTATAACAAAAGCTAACTGTAAAAAGGATTCTAAAAGTTTCTATATCAAGTACAGCTTCGAAACGCTATGTCCACAAACGATAAACGATGACTGCAACAGAATAGCAGATACTGCATTGAAATCTCAGAAAAATCCTCTGTCTGCGTTAAACATTAAGTATTCTTTCAAATTCTACCAACACTCGAGGCTTGAGATAGATGCGCTGAGACAGGCGCTGAACGACCCAAAGTTTCGACCAGAACCGTTGAGTGTTATCAACGCAATGGTAGCTCACGGTCGCGTTGAATTGTTAGCTCATCCTTTGAGCCAAAAGTATCTTCAAATGAAATGGAATTCTTATGGAAAATATGTCCACCTACTGAATCTCTTAATATACTGCATAATCCTCCTATTAGTGACGCTGTATATTTATTTTTTGATGACCAAAATGCCTAAGAATATGAATACAGCAAGCCATGTTAGCAATGAGACTGCATATGTATTTAATTCTCTATTAAATGACACTGCTACTACTGGTTTCAAGCAAGATTTCGGAATATATGCTAGTGCAGGGATTATATTAACTTATAACTTCATTTGTGTGATACGCGAAATGTACAATATAAAGGAACAAAAATGGCATTATATTGTTGACCTCTCGAATTTTGTTTCCTGGATGCTGTACGTCAGTTCAACACTCATGACTTTGCCATCTATATACCCAATTTATCAAAATATTCAGTCTTCAGCTGCTTCAATTACCGCCTTCTTAGCATGGTTCAAATTGCTACTACTACTTCAACGCTTCGATCAAGTTGGGATCTACGTGGTTATGTTTTTGGAGATACTACAAACACTTATTAAAGTTTTGATGGTGTTTTCAATACTAATAATAGCATTCGGACTAGCCTTCTTCGTCCTTTTATCTAATGGCCAGCATTTATCATTCAGCAGCATACCGATGTCATTGATGAGAACATTCACGATGATGTTGGGAGAGATTGACTTTGTTGGTACTTATGTCCAACCCTACTATAAGACAGAAATAGACGTTCTCTTACCTTTCCCTATTCCAACTTTCTTTATCCTCGGCCTGTTTATGGTCTTTATGCCAATCCTATTGATGAACTTACTCATTGGTTTGGCTGTCGGAGACATTGAGAGCGTCAGGCGAAATGCGCAGCTAAAACGCTTGGCTATGCAAGTAGTTTTACATACAGAATTAGAACGAAAATTGCCTGCTTGCATTTTGGAAAATGTTGACAAAGATGAACTGATTGAATACCCAAACAACAACAAATGCAAACTCGGTTTCCTTGACTTAATTCTACGGAAATGGTTCTGCAATCCATTCACTGACGACGCTGAATCTGTTTCACAAGCCGTCGGTCTTGACTTGGTCCTGGAGAGTAAAGAAGATTACATGACTGCTGAAATGGACAAGCAGAAGAGACGTTTACGCGAAATGTCACAATTACTTGAACAACAGCATACTCTTATTCGTCTCATTGTGCAGAAGATGGAAATAAAAACTGAAGCTGATGATGTCGACGAAGGGGTTTCTCCAGCGGAAACGCGTGTTGTACCGAGGTGGAGTACACCACGCATCAGAAAACAACTTCACACCGCTGCGTCATTTAATAAAGGGATCTAA

Protein sequence:

>DPOGS203987-PA
MKMDPGGELQVMLPHQIQDRITSGDVCRMSDSPYRIHKAAESGNVEDFMRLYLTEPSRISIRDSNGRTAAHQAAAKNHTNILHSINKYGGALDIADNAGNTPLHLAVENESLDAIDFLLQQNVDTSSLNEKRQAPIHLATELNKVSVLKVFVKHKTKFDVDIEGEHGRTALHFAAIHDHDMCARILISELGAQCKLQCNNGYYPIHEAAKNASSRTMEVFLQWGESEGCTREKMMSLHDNEGNVPLHSAVHGGDIRAVELCLRSGAKISEQQYDFSTPVHLACAQGALEIVKLMFTMQPEEKMACLMSCDVQEMTPLHCAAMFDHPEIVKYLVNEGSDLNPLDKEKRSPLLLSASRGGWRTVHTFILLGANMELKDINSRNVLHHVVMNGGRLEDFATTCKNRCEKSLSQLLNEKDNNGCSPLHYASREGHIRSLENLIKLGACINLKNNNNESPLHFAARYGRYHTACQLLDSDKGTFIINESDGEGLTPLHIASREGHTRVVQLLLNRGALLHRDHNGRNPLHLAAMSGYTQTVELLHSVHSHLLDQTDKDGNTPLHLATMENKPNSIALLLSMGCQLSYNSLEMSAIDYAIHYKFPEAALAMVTHEERAKEVMALRSDRHPCVTLALIAYMPRVFEAVQDKCITKANCKKDSKSFYIKYSFETLCPQTINDDCNRIADTALKSQKNPLSALNIKYSFKFYQHSRLEIDALRQALNDPKFRPEPLSVINAMVAHGRVELLAHPLSQKYLQMKWNSYGKYVHLLNLLIYCIILLLVTLYIYFLMTKMPKNMNTASHVSNETAYVFNSLLNDTATTGFKQDFGIYASAGIILTYNFICVIREMYNIKEQKWHYIVDLSNFVSWMLYVSSTLMTLPSIYPIYQNIQSSAASITAFLAWFKLLLLLQRFDQVGIYVVMFLEILQTLIKVLMVFSILIIAFGLAFFVLLSNGQHLSFSSIPMSLMRTFTMMLGEIDFVGTYVQPYYKTEIDVLLPFPIPTFFILGLFMVFMPILLMNLLIGLAVGDIESVRRNAQLKRLAMQVVLHTELERKLPACILENVDKDELIEYPNNNKCKLGFLDLILRKWFCNPFTDDAESVSQAVGLDLVLESKEDYMTAEMDKQKRRLREMSQLLEQQHTLIRLIVQKMEIKTEADDVDEGVSPAETRVVPRWSTPRIRKQLHTAASFNKGI-