Monarch geneset OGS2.0

DPOGS204525
TranscriptDPOGS204525-TA3381 bp
ProteinDPOGS204525-PA1126 aa
Genomic positionDPSCF300297 - 384508-398040
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0087350.094.49% 
BombyxBGIBMGA004308-TA0.079.26% 
Drosophilaiav-PA0.070.32% 
EBI UniRef50UniRef50_E2BHT30.062.96%Transient receptor potential cation channel subfamily V member 6 n=15 Tax=Pancrustacea RepID=E2BHT3_HARSA
NCBI RefSeqXP_001121881.10.067.65%PREDICTED: similar to CG4536-PA [Apis mellifera]
NCBI nr blastpgi|2700142880.057.16%hypothetical protein TcasGA2_TC012368 [Tribolium castaneum]
NCBI nr blastxgi|2700142880.056.95%hypothetical protein TcasGA2_TC012368 [Tribolium castaneum]
Group
Gene OntologyGO:00160203.1e-09membrane
GO:00550853.1e-09transmembrane transport
GO:00052163.1e-09ion channel activity
GO:00068113.1e-09ion transport
KEGG pathwaymdo:1000110692e-52 
 K04975 (TRPV6)maps-> Salivary secretion
InterPro domain[123-318] IPR0206832.1e-19Ankyrin repeat-containing domain
[518-631] IPR0058213.1e-09Ion transport
Orthology groupMCL10729 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204525-TA
ATGGGTGTACCGTTAAGTAAGCTGTGTTCAGCAACGAGTGTCCCAGCCGTTGGCTCGGTTCTCGACCGTGTCATATCCCAGCCCAGCAGCGAAGATCACACTGTCTTATATAAGCTCGCTGATTATAAAAAAGGAGGTTTACTATTAGAAACCTATACTAAGGGCGGCGTGGTAGCCGCCGAGCGCCTCATCAGAGAGGAGTTCTCGGCATATATGTACGCTGGTGGACGGGGCCGGGTCATCAACAGAGCTGAATACCTCCGATGGAAGTTTCGAGACCAGGAACAGGTTGTTCTCCCAATAGAAGCGTCTCTCTCACCTCACGATCCTCTAGCGAAATGGGAAGATCACACAGCTTGCTGGCAGATGAGTTACCGCGGAGCACTGGGGGAGTCACTTCTTCATGTTCTCATTATATGCGACACCAAGATCCACACAAGATTGGCGAGAACTTTGGTCAAATGTTTTCCGAAATTGTCCCTGGATGTCGTCGAAGGTGAAGAATATTTGGGTGCTAGTTCGTTACATTTGGCCATCGCCTATAGTAATAATGAATTAGTCCAAGACTTGGTCGAAGCTGGCGCGGATGTTAGTCAGAAAGCTATCGGTAGTTTCTTTCTCCCGAGAGACCAACAAAAAAATCCACCAGCTCGTCAGACAAATTACGAAGGTTTGGCGTATTTAGGGGAGTACCCACTAGCGTGGGCTGCTTGTTGCGCTAACGAGGCCGTATATAACCTCCTTCTGGACTCTGGAGCTGATCCAGACGCCCAGGATTCATTTGGAAATATGATCCTGCATATGGTTGTAGTTTGTGATAAGTTGGATATGTTCGGCTATGCCCTTCGTCATCCAAAGGTACCAGCCAGTAACGGCAGACTAAATAAGGCTGGGTTCACACCGCTCACCCTTGCCTGCCAATTGGGGCGAGCCTCGGTGTTCAGGGAAATGTTGGAGCTTTCATCCAGGGAATTTTGGAGATATTCCAACATCACCTGTTCTGCATATCCATTGAATGCTCTAGACACTTTGCTACCTGATGGACGTACAAATTGGAATTCCGCATTATTCATCATACTTAATGGCACTAAACAAGAGCATCTTAATATGTTGGACGGGGGAATCATACAAAGGCTCTTAGAAGAGAAATGGAAAACATTTGCCAGAACAAAATTTTTAAAGCGTCTATTAATCCTCATGCTACATCTGATCTTGCTATCCATATCTGTGTATCTACGTCACAGCAGTCTGGAAGCGGATGTAGACCCCGACTGGGGTTTAGAGGTCAATGATGCAAGGTCCGGGATAAGATTGGCCTGCGAACTAGGAACTATCATAAGCACTCTTTGCTATATTATTCTGCAGCAAGGCGATGAGATCAAAAACCAAGGTGTTGTTGCGTACTTCAAGCAACTAATCCATGAACCAGCCAAATTTATATTCCTAGCTTCAAACATATTGTTACTGGCGTGTATTCCAGCAAGAATAAGCCAAAGAACTACTTTGGAAGAAGCAATACTAATATTCGTACTACCCGGTTCTTGGTTCCTTATGATGTTTTTTGCAGGAGCCGTGAAGTTGACTGGCCCGTTTGTTACAATGATATACAGCATGATCACTGGGGACATGTTCACATTCGGCATTATCTACTGCATAGTATTGTTCGGATTCTCGCAATCTTTTTACTTCCTGTATAAAGGGTTTCCGAACGTACATTCAACTCTGTACTCCAGCTATCCCAGCACCTGGATGGCTCTTTTTCAAATCACGTTAGGCGACTACAGTTACACAGATCTCGGTCTGACGATGTATCCTAATTTGTCGAAGACAGTGTTCACTGTTTTCATGGTGTTCGTCCCCATTCTGCTACTCAACATGCTGATAGCCATGATGGGTAACACATACGCACACGTTATAGAGCAATCAGAGAAGGAATGGGTAAAACAGTGGGCGAAGATTGTTGTGTCTCTGGAGCGTTCAGTATCCCAGGAAGATGCTCATCGTTACCTTCAGGAGTATTCAATAGGTCTGGGACCCTCAGACGACCCTCGTTATGAGCAACGAGGTGTTATGGTAATTAAGAGCAAGGCCAAAACTCGCGCCAAGCAAAGAAAGGGCGCGCTGTCTAATTGGAAGCGTGTGGGTAAAGTTACTATAGCAGAGCTGCGGCGCCGCGGCATTAGTGGAGAAGAATTGAGAAGACTCATGTGGGGGAGAGTCTCAATATCTACTCCAACCAAAGCACCCTTAAAATGTGTGCCTCCCCCGGAGTTAGTGACTTCAGAAATTCCAGGAAGTGGAGTTGGACCAGCGTTGTCATCAGCTCTTAACGTGATGGCGTACACACAAGATCTAGACCTCACCAACACCGGATCAGAACTGCATAAACAGATAACGCCCGATTTGCTCGTCAATGGCAAGACGCCTGTGACAGCACAGAATACAGCAACCACTCCCAAAGTACCACTTGTAAATCTACCAAATAAAGCGGGATTGGACGTTCAAAGTACGGGAAGTGATCAGAGCGTGCCAACTGCATTAACTAAGAATATGGACGTTTTGGGCATAAATATGTCAACACAGGATTTGCTAAAGAATCAAACTATAAACAATCCCGCTACAGCCAACGAAATAGTGTTCAAAGATTATCTCAGGGATATCATAAAGGCGGAGCAACTTGGTCTAGATAATATTGACATAAAGGCCCTCGCAGAGAAAGCGGCCAACCTCACCGACGTACCAGAGATAGACATTAACATATCTACGGCTGCAAGGTCTGCCCGTCGTGTGGTAGCCGGCGCTGTTTCTGGTCTGTTCGGTGTGACAGCAGAGACTCCGCGGGACGCAGGCTGGCGCAGAGAGAGACACGAACATACAGACAGTGACCCTGTCCCAGAGTGCGTCATACTGGGTCGTGCGGCGCGCGCTCGCCGTGCACGATCTGCTTCCCGGCGCGCGCCCCCTCCCCCACCTCACCTCTACGTTCCTGCGAGATCTATGTACTTGGTTGCATCGGAGAGTAGCGCTGTAGAAAGCGACGCACCCTTAGAAGATCAAGCTAGTTCAGGAAATAACTCATCTATGTACTTGGTTGCTTCGGAGAGTAGCGCTGTAGAGAGCGACGCACCCTTAGAAGATCAAGCTAGTTCAGGAAATAACTCAACAATAAGAGGCAGACAGGACGACAGTCTACGAGCTGTCAGACCACTATGCATTCAACAGGCTACTGCATTAATATCACCAGCGGCGCAAAACGAACATTTAATGTTTATTAAAGAATCTGGAAACGGAGCTGAAGTGGAAGGACCGGCGCCCGTTCGGAAAGTCAAAGTCACTACGAAAACGCGACCCAAAACCGCCAAAGCTAGGCGTAACAGGTAA

Protein sequence:

>DPOGS204525-PA
MGVPLSKLCSATSVPAVGSVLDRVISQPSSEDHTVLYKLADYKKGGLLLETYTKGGVVAAERLIREEFSAYMYAGGRGRVINRAEYLRWKFRDQEQVVLPIEASLSPHDPLAKWEDHTACWQMSYRGALGESLLHVLIICDTKIHTRLARTLVKCFPKLSLDVVEGEEYLGASSLHLAIAYSNNELVQDLVEAGADVSQKAIGSFFLPRDQQKNPPARQTNYEGLAYLGEYPLAWAACCANEAVYNLLLDSGADPDAQDSFGNMILHMVVVCDKLDMFGYALRHPKVPASNGRLNKAGFTPLTLACQLGRASVFREMLELSSREFWRYSNITCSAYPLNALDTLLPDGRTNWNSALFIILNGTKQEHLNMLDGGIIQRLLEEKWKTFARTKFLKRLLILMLHLILLSISVYLRHSSLEADVDPDWGLEVNDARSGIRLACELGTIISTLCYIILQQGDEIKNQGVVAYFKQLIHEPAKFIFLASNILLLACIPARISQRTTLEEAILIFVLPGSWFLMMFFAGAVKLTGPFVTMIYSMITGDMFTFGIIYCIVLFGFSQSFYFLYKGFPNVHSTLYSSYPSTWMALFQITLGDYSYTDLGLTMYPNLSKTVFTVFMVFVPILLLNMLIAMMGNTYAHVIEQSEKEWVKQWAKIVVSLERSVSQEDAHRYLQEYSIGLGPSDDPRYEQRGVMVIKSKAKTRAKQRKGALSNWKRVGKVTIAELRRRGISGEELRRLMWGRVSISTPTKAPLKCVPPPELVTSEIPGSGVGPALSSALNVMAYTQDLDLTNTGSELHKQITPDLLVNGKTPVTAQNTATTPKVPLVNLPNKAGLDVQSTGSDQSVPTALTKNMDVLGINMSTQDLLKNQTINNPATANEIVFKDYLRDIIKAEQLGLDNIDIKALAEKAANLTDVPEIDINISTAARSARRVVAGAVSGLFGVTAETPRDAGWRRERHEHTDSDPVPECVILGRAARARRARSASRRAPPPPPHLYVPARSMYLVASESSAVESDAPLEDQASSGNNSSMYLVASESSAVESDAPLEDQASSGNNSTIRGRQDDSLRAVRPLCIQQATALISPAAQNEHLMFIKESGNGAEVEGPAPVRKVKVTTKTRPKTAKARRNR-