Monarch geneset OGS2.0

DPOGS211899
TranscriptDPOGS211899-TA2862 bp
ProteinDPOGS211899-PA953 aa
Genomic positionDPSCF300011 - 235501-238362
RNAseq coverage461x (Rank: top 27%)
Annotation
HeliconiusHMEL0166960.079.10% 
BombyxBGIBMGA001166-TA0.071.37% 
Drosophilapain-PA5e-10231.67% 
EBI UniRef50UniRef50_UPI0002062A141e-14436.68%UPI0002062A14 related cluster n=1 Tax=unknown RepID=UPI0002062A14
NCBI RefSeqXP_001950177.12e-14536.68%PREDICTED: similar to GA13986-PA [Acyrthosiphon pisum]
NCBI nr blastpgi|3287245824e-14436.68%PREDICTED: transient receptor potential cation channel protein painless-like isoform 1 [Acyrthosiphon pisum]
NCBI nr blastxgi|3287245821e-14335.74%PREDICTED: transient receptor potential cation channel protein painless-like isoform 1 [Acyrthosiphon pisum]
Group
Gene OntologyGO:00160204.2e-14membrane
GO:00550854.2e-14transmembrane transport
GO:00052164.2e-14ion channel activity
GO:00068114.2e-14ion transport
GO:00055153.8e-06protein binding
KEGG pathway 
InterPro domain[273-429] IPR0206835.4e-46Ankyrin repeat-containing domain
[586-775] IPR0058214.2e-14Ion transport
[392-418] IPR0021103.8e-06Ankyrin repeat
Orthology groupMCL12876 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211899-TA
ATGACGAAGGACAGCTACGAGATGAACAATCGTCTGTCTCGAGGCGGCTCGCTGTTCGGCGGCGATCCTCAAGTGCAACTGCGAACCGCGCTCCGTTCCAACGACTACGGAACATTCAAGAAGCTAGTCAGTTACGGTGCCGTCGACCTCGAACACGTGTACCCTTACCCCGACTACAAGACCTGCCTGGAAATCGCAGTTTCAGAACCAAATAAAATAGAATTCATCAAACTCTTACTCCAGCATGAGGTCCAAGTTAATAAAATCAACGAAACACATGGCGGAGCACCGATCCATTTCGCTGTTGAGAACGGTAACGCTGAGGCTTTAGAAGTACTCCTAGAAGATGACAGAATAGATCTGGACACGAGGTGGAAAGGCAACACCGCGCTACTGATGGCCATCAAGCAGATACAAGATTTAGACGAAGATCGTGAACACGACCTGGACATATATGAGGATATGGTAGAGAAGCTTCTCAAAGCCGGATGCAACGCCAATTCCCCAGATCTGAGAGGTATAACACCTGTATATTCGGCAGCTAAGCAAGGTCTAGAGAGGGTCGTGACCCTGATCTTAGACTATTCAAAAGATCCAATCGATTTAGACACTTATAAAGATATAAAAGGCAAGACAGCAAGGTACTATTTGAAAGAGGCATTTCCACATTTACTGCCAAAATTTGATTCCGCCGTAGAAAACATTGAGCCCAGTATCGACAAGGATTTACTGTTTTCATATTTACTAAGACATGAAGAAGATAATTTTATAAGAGACTTCACTAAGCTCAGTAGAAAGAACGAACATCGAGCCATGCTTGCGGCAGATAACGGCATGAATACGATGTTGCAACTGGCAGTTGACAAAGGACTGGAAAAGGTCGTGCAGACATTACTCAGTGCGGGTGCAGACGTGAACGCCACTTGTTCAGGAAACAACAGACGACCCATCGCTATAGCGTGCCACAACGGGTATCATAAGATCCTTAAAATGTTTATAGACAGCGATTCCAGTCTGTTCGACCCGGTCAACAGCGAATCCCTGGTGCAAATCACCGTCAAAGGAATGCGAAAAGCTGTCGAGAGTCCGAAGATCAATTACAAAGCTTGCTTGAACTTATTGTTGAAGCATCCCAAAGTTAACGTCGACATCAACCACCAGGACATGAAGGACAATACAGCGCTACACTACGCTGCCAGGAGCGGCGACAGCGAAACCGTTTTAGATTTACTGAGAAACGGAGCGTGCGTTGGTCTCAATAACGCGTTCGACGAGCCCCCGCTAGCCGACATCAACGCAAAAACACTAGAAACCTACTTGGACGAATGCATCACGACGAACAGCGAGCGGCCGAGTGACGAGAATTACGAGATACATATGAAATACAGCTTTCTGGTGTATCCGAATAATTCGTTGGAAAACGAACTGTGCCAGGTGCCGCTCATGGAAAAATCGAACAATAATAACGACAGCATCAGAAAATGTGACGCCATTTTAGCTCCGGAGACGGAAGCGCTCTTATACATGACAAGAAATGAGGAATTGCGGCCGCTCCTGAAACATCCAGTCATAACGAGTTTCCTTTACTTGAAATGGCAAAGGATAAGTTGCCTGTTCTACGCAAACATAACCTTCTACTCTCTCCTGTGGCTGTGTTTGATTCTATATATCATTTTGGGCTACGGAGTGGAAGGTCGGAAAAATAAATCCGTTGAAGCCCTGAATGTTCTGACGCACGTCGGAGCTGTCATTGGATTGATACTTCTGATTATGCGCGAGCTGTTCCAACTGCTTGTGTCACCGACACGATATTTGCAGAGCATAGAAAATTGGATGGAGATAGCTTTAATATTTGTCACCGCCTGGATTCTTGGTTATGACGCTGCAGAGGAATCGACTAAACAACAATTATCAGCTGTCGCCATATTACTATCATCAGCTGAACTTGTTCTGCTAATCGGTCAATTTCCAACTCTGTCAACTAACATCGTGATGCTGCGTACTGTCTCCTGGAATTTCTTTAAATTCCTTCTCTGGTATTGCATTCTTATAATAGCATTCGCTCTAAGTTTCTACACGCTGTTCAGGAAAGTTATATCGGAAGGAGATCAAAAGGCACCGAATCTGAATGAGCAAGCGTCGGAGGACGAGGACGAGGACTTCTTTGAAGACCCCGGCAGTTCCTTGTTCAAGACGATCGTCATGTTGACCGGTGAATTCGACGCCGGTTCTATAAAGTTCAGCACCTTCCCGGTGACCAGCCATCTTATATTCACAGTGTTCGTGTTCATGGTGCCCATAGTTTTGTTCAATTTGTTAAACGGTTTAGCCGTCAGCGACACACAAGAGATACGAGCGGACGCCGAGCTGGTGGGTCACATATCACGTGTTAAGCTCATTTCCTACTTTGAAAGCGTTCTCATCGGTAAAGCCTACACCAAGCCGAGGAGATGCTGGTCGTGGCTGCCGGCATACCTACAGAACGTACACTTCATCAAGCCCCAAATGTTATGCATCAAGCCGTTCGCCAAGCGAATATGTCTTTTTCCACACTTTCTGCCGAGGTTTCGTATCATCGTGATGCCGAATCAGAACAATCGAATCGAAATACCCAGACCTGAACCGCTCATGAAGGCCGGCGATGATTACGAGGACATCGAGGGCGGGAAGTGCTGTTTTGAAGGATGCCAATCCTTGAGATTGGAGCGAAGAATAGTCAAAAACGCTAAATTTATTATAAGCAGACGAACACAGGTTTCAGAATTCGACGAAATCAAATCAAGGCTCTCCGCGTACGAGAACAAAATCAGCGGTCTGGAGGTTGCTTTGAAGAAAATCCTTACACGAATGGACATCCCGTAA

Protein sequence:

>DPOGS211899-PA
MTKDSYEMNNRLSRGGSLFGGDPQVQLRTALRSNDYGTFKKLVSYGAVDLEHVYPYPDYKTCLEIAVSEPNKIEFIKLLLQHEVQVNKINETHGGAPIHFAVENGNAEALEVLLEDDRIDLDTRWKGNTALLMAIKQIQDLDEDREHDLDIYEDMVEKLLKAGCNANSPDLRGITPVYSAAKQGLERVVTLILDYSKDPIDLDTYKDIKGKTARYYLKEAFPHLLPKFDSAVENIEPSIDKDLLFSYLLRHEEDNFIRDFTKLSRKNEHRAMLAADNGMNTMLQLAVDKGLEKVVQTLLSAGADVNATCSGNNRRPIAIACHNGYHKILKMFIDSDSSLFDPVNSESLVQITVKGMRKAVESPKINYKACLNLLLKHPKVNVDINHQDMKDNTALHYAARSGDSETVLDLLRNGACVGLNNAFDEPPLADINAKTLETYLDECITTNSERPSDENYEIHMKYSFLVYPNNSLENELCQVPLMEKSNNNNDSIRKCDAILAPETEALLYMTRNEELRPLLKHPVITSFLYLKWQRISCLFYANITFYSLLWLCLILYIILGYGVEGRKNKSVEALNVLTHVGAVIGLILLIMRELFQLLVSPTRYLQSIENWMEIALIFVTAWILGYDAAEESTKQQLSAVAILLSSAELVLLIGQFPTLSTNIVMLRTVSWNFFKFLLWYCILIIAFALSFYTLFRKVISEGDQKAPNLNEQASEDEDEDFFEDPGSSLFKTIVMLTGEFDAGSIKFSTFPVTSHLIFTVFVFMVPIVLFNLLNGLAVSDTQEIRADAELVGHISRVKLISYFESVLIGKAYTKPRRCWSWLPAYLQNVHFIKPQMLCIKPFAKRICLFPHFLPRFRIIVMPNQNNRIEIPRPEPLMKAGDDYEDIEGGKCCFEGCQSLRLERRIVKNAKFIISRRTQVSEFDEIKSRLSAYENKISGLEVALKKILTRMDIP-