Monarch geneset OGS2.0

DPOGS213917
TranscriptDPOGS213917-TA3015 bp
ProteinDPOGS213917-PA1004 aa
Genomic positionDPSCF300218 + 107420-138474
RNAseq coverage423x (Rank: top 29%)
Annotation
HeliconiusHMEL0060710.078.22% 
BombyxBGIBMGA004625-TA0.067.03% 
DrosophilaClC-a-PD0.058.06% 
EBI UniRef50UniRef50_B0WA140.054.55%Chloride channel protein 2 n=1 Tax=Culex quinquefasciatus RepID=B0WA14_CULQU
NCBI RefSeqXP_001604692.10.057.45%PREDICTED: similar to chloride channel protein 2 [Nasonia vitripennis]
NCBI nr blastpgi|3320178230.057.53%Chloride channel protein 2 [Acromyrmex echinatior]
NCBI nr blastxgi|3479687700.056.89%AGAP002891-PE [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550854.1e-112transmembrane transport
GO:00052164.1e-112ion channel activity
GO:00160202e-70membrane
GO:00068212e-70chloride transport
GO:00052472e-70voltage-gated chloride channel activity
KEGG pathway 
InterPro domain[109-970] IPR0018070Chloride channel, voltage gated
[168-642] IPR0147434.1e-112Chloride channel, core
Orthology groupMCL10733 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213917-TA
ATGAAGTTTAAAAAACAGACCCTAGTGTCGGGGGATACGTCGGAGGAGGAGATCGGGCACAGCTATTTGGGAACTCTGGGTCCGTCGGTACATATAGCGTCTATGGTGGCTACGTTGCTGTCTAAACTGGTAACCACCTTCCAAGGGATATATAGTAACGAATCTCGGACAAGCGAGATGTTGGCTGCTGCTTGTGCTGTAGGAGTTGCTTCCTGCTTTGCTGCCCCTGTGGGAGGTGTGTTGTTTTCCATCGAAGTAACGACGACTTACTTCGCGGTGAGGAACTACTGGCGAGGATTCTTCGCTGCCTGCTGCAGTGCTATTATGTATGGTCGCTATCAACGCGATTTAAGCGAAGCGGCCAGAGAAGAAGCAAGAAGACTAAGGAGATTGCGGAAGAAAAGACGAAAAGATGATAAGTTACGGCAAAAAGAATTGGAAGCCTCAGGAAAACATCGCCCCAGAGGAAGGTTCTTTAAAGTATTAGGCTACATTTGGCGGAATACCTTCGCACGTCTCGGCGAGGACTGGGTCTTCCTTGCGCTGCTGGGCATCATAATGGCTGTACTAAACTTTGCCATGGACAAGGGCATTGCAGTATGCAATAATGCTCGTATGTGGATGTATAAAGACCTGGCCACATCCACCTTCAGTCAGTATGTGGCGTGGGTCTCTCTGCCTGTTTGTCTAATCCTCTTCGCTGCCGGCTTCGTCCATATTGTCGCCGCTCAGAGTATTGGTTCTGGCATACCAGAGATGAAGACGATTCTAAGAGGAGTCCACCTCAAGGAATACTTGACGTTCAGAGCTATGGTCTCTAAAGTTATCGGTCTGACAGCCACCCTTGGATCTGGTTTACCATTGGGTAAAGAGGGTCCGTCGGTACATATAGCGTCTATGGTGGCTACGTTGCTGTCTAAACTGGTAACCACCTTCCAAGGGATATATAGTAACGAATCTCGGACAAGCGAGATGTTGGCTGCTGCTTGTGCTGTAGGAGTTGCTTCCTGCTTTGCTGCCCCTGTGGGAGGTGTGTTGTTTTCCATCGAAGTAACGACGACTTACTTCGCGGTGAGGAACTACTGGCGAGGATTCTTCGCTGCCTGCTGCAGTGCTATTGTACATAGAGATAAAAATGATTATAATAAAAGAACTATAGGTTTTAGAACTTCCAGCACTCAGATAATAGAAATATACTCGATTTTAATTAGCTACTTTATCGTCTGCGGGCTGATGGCAGCCCTGTGGGTGTTCCTCCACCGCCAGTACGTGCTCTTCATGAGAAACACCAAGGTCCTCAGTAACTTCCTACAGAAAAACCGCTTCATCTACCCTGGAGTGATGACCTTGGTTGTGATGTCCGTTCTGTTTCCTCCCGGGATTGGGAAGTATATGGCGGCTGACCTTGGAAACCAGGAACAGGTTTTGTCTCTGTTTTCGAACTTCACGTGGTCCGATGCGTTGACAGCGGAGCAGGCGGCGCTGGTCGATCACTGGCGGACCGAGGACGTTGGACACTTCGCTGTACTCGTTATTTACTTCTTCAGCATTTTCTTCCTCAGTATGGTTTCCTGCACACTTCCGGTTCCTGCTGGTATATTCGTGCCAGCGTTCAAGATGGGCGCCGCCCTAGGCCGGTTCACTGGAGAAGTGATGCACTACTTCTGCCCCCTTGGCGTCGCTTACGGTGGACACATACAGAAGATATTGCCTGGTGGTTACGCGACAGTAGGTGCCGCTGCGTTCACCGGGGCCGTAACTCACACCGTTTCTACGATCGTTATATGTATTGAGATGACAGGACAGGTGACTCACCTGCTGCCTATCATGGCGGCGGTGCTGTCCGCTAACGCGACAGCGGCTCTGCTGCAGCCCTCGTGCTTCGACAGCATCATCCTCATCAAGAAGCTGCCTTACCTGCCCGATCTGCTCTCGTCAGCGAGTCGTATGTACGATATATGCGTGGAAGACTTCATGGTGAGAGACGTCAAGTACATCTGGAACAGGATGACCTTCCAGCAGTTGAAGGATTTGCTTAAAGAAAATAAGTCTATCAAGAGTTTCCCACTGGTATCCTCCCCATCCTCTCCCGTGCTCCTCGGGTCCATCCATCGCTGGGAGCTGGTGCGCCTGATCGAGCAGCGTGCTGGACGAGCCAGACGGCTCCAGGTTGCAGCTTTATGGAGACGAGAGGCTGAAGCTAGAAGAAGGCCCTCCCGCTTTGAAGTCACCGCAGCCTCGCTTACTGATACCAGCAAGGCAGGGCTCGTACCACCACCCGGCCAGCTGTTCCGCCCTAAGTCTATCTTGAAGAAGACCAATTCGTTCACTCTAACTCGTGGTCTAAGTTCACCTTCGACCCCCTCGACCCCGCAGCCTAATGTGTACACCACTGTAACCGGCGCGGAGACCAGGATCCGCGCGGCGTTCGAGGCTATTTTCAAGCGGTCAACTTTGCTGCCGGACGTGGAGGGTGGACTCGGAGACCACGGCCTGCCCAGAAGTCCGTCCATCAACAAGAAAGTACAATTGCCCCGCGAGCGTGTATGTGACATGTCCCCCGAGGATCAACGAGCCTGGGAGATGATGGAGATGTCCCGGGAGATAGACTTCGATAGAATGCTGACCATCGTCCGGCATAGAGATATGACGGCGGAGGAGTCCGATCATGACGATGAAGACGACTCGCTGTACGTGTGTCACATCGACCCAGCGCCCTTCCAACTGGTTGAGAGGACCTCGCTTCTTAAGGTCCACTCTCTCTTCTCTACTCTCGGCGTGAGTCGCGCATACGTCACCGCTATAGGAAGACTCATCGGTGTTGTAGCGCTTAAAGAGCTTCGGAAGGCCATAGAGGATGTGAATTCCGGTACATTGACCCCCACCAGCCACACCGCTGCGGCGACGTCGCTTCCGGTCCCTCGACCTCCGACTGTCCTGGTCCAACCGCCCCGGGAGCCCGCTCCTCCCTCCGACAAAGACACCGACAAACTGACAGTTGCGAGCGATAAATGA

Protein sequence:

>DPOGS213917-PA
MKFKKQTLVSGDTSEEEIGHSYLGTLGPSVHIASMVATLLSKLVTTFQGIYSNESRTSEMLAAACAVGVASCFAAPVGGVLFSIEVTTTYFAVRNYWRGFFAACCSAIMYGRYQRDLSEAAREEARRLRRLRKKRRKDDKLRQKELEASGKHRPRGRFFKVLGYIWRNTFARLGEDWVFLALLGIIMAVLNFAMDKGIAVCNNARMWMYKDLATSTFSQYVAWVSLPVCLILFAAGFVHIVAAQSIGSGIPEMKTILRGVHLKEYLTFRAMVSKVIGLTATLGSGLPLGKEGPSVHIASMVATLLSKLVTTFQGIYSNESRTSEMLAAACAVGVASCFAAPVGGVLFSIEVTTTYFAVRNYWRGFFAACCSAIVHRDKNDYNKRTIGFRTSSTQIIEIYSILISYFIVCGLMAALWVFLHRQYVLFMRNTKVLSNFLQKNRFIYPGVMTLVVMSVLFPPGIGKYMAADLGNQEQVLSLFSNFTWSDALTAEQAALVDHWRTEDVGHFAVLVIYFFSIFFLSMVSCTLPVPAGIFVPAFKMGAALGRFTGEVMHYFCPLGVAYGGHIQKILPGGYATVGAAAFTGAVTHTVSTIVICIEMTGQVTHLLPIMAAVLSANATAALLQPSCFDSIILIKKLPYLPDLLSSASRMYDICVEDFMVRDVKYIWNRMTFQQLKDLLKENKSIKSFPLVSSPSSPVLLGSIHRWELVRLIEQRAGRARRLQVAALWRREAEARRRPSRFEVTAASLTDTSKAGLVPPPGQLFRPKSILKKTNSFTLTRGLSSPSTPSTPQPNVYTTVTGAETRIRAAFEAIFKRSTLLPDVEGGLGDHGLPRSPSINKKVQLPRERVCDMSPEDQRAWEMMEMSREIDFDRMLTIVRHRDMTAEESDHDDEDDSLYVCHIDPAPFQLVERTSLLKVHSLFSTLGVSRAYVTAIGRLIGVVALKELRKAIEDVNSGTLTPTSHTAAATSLPVPRPPTVLVQPPREPAPPSDKDTDKLTVASDK-