Monarch geneset OGS2.0

DPOGS203340
TranscriptDPOGS203340-TA1938 bp
ProteinDPOGS203340-PA645 aa
Genomic positionDPSCF300003 - 144292-149593
RNAseq coverage1002x (Rank: top 13%)
Annotation
HeliconiusHMEL0135320.080.68% 
BombyxBGIBMGA003862-TA0.089.63% 
DrosophilaClC-c-PC0.078.63% 
EBI UniRef50UniRef50_Q7Q6L50.087.46%AGAP005777-PA n=15 Tax=Bilateria RepID=Q7Q6L5_ANOGA
NCBI RefSeqXP_315792.40.087.46%AGAP005777-PB [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582947560.087.46%AGAP005777-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|2700038720.088.94%hypothetical protein TcasGA2_TC003158 [Tribolium castaneum]
Group
Gene OntologyGO:00550852.6e-176transmembrane transport
GO:00052162.6e-176ion channel activity
GO:00160206.5e-100membrane
GO:00068216.5e-100chloride transport
GO:00052476.5e-100voltage-gated chloride channel activity
KEGG pathway 
InterPro domain[1-624] IPR0018070Chloride channel, voltage gated
[395-543] IPR0147432.6e-176Chloride channel, core
Orthology groupMCL10774 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203340-TA
ATGCGGCATCGATACATCGTGAAAAAGCGTCAGGATTCCATCTGGAACCTCATTAAAGGCGCTCACGACGCTTGGTCGGGATGGGTGTGCGTTCTCCTTGTGGGCGTGTGTACAGGGGTGGTGGCTGGTGTCATCGACATTGGCGCTTCTTGGATGACGGACCTCAAGTTCGGTATATGTCCCCAGGCGTTCTGGTTCAACAGGGAGCAGTGCTGCTGGTCCAACGATGAGATAACTTTTGATCACGGCAACTGCTCGCAGTGGATGACCTGGGCCCAGCTGTTCGGCGAGTCTAAGGAGGGTGTTGGCGCTTACATCATCAGCTACTTATTCTACATCGTTTGGGCGTTGTTGTTCGCGGCGCTCTCTGCATCTCTAGTGCGTATGTTCGCGCCGTATGCCTGCGGATCTGGTATACCGGAGATTAAGACGATTCTAAGCGGATTCATCATCAGGGGCTACCTCGGCAAGTGGACGCTTGTCATCAAAGTGGTTGGCCTTATCTTGTCCGTGTCATCAGGCTTGTCCCTCGGCAAAGAAGGACCAATGGTCCATATCGCCAGCTGCCTAGGTAATATCCTGTCGTACCTCTTCCCAAAATATGGACGGAATGAGGCAAAGAAACGTGAGATTCTTTCGGCAGCAGCGGCTGCTGGTGTGTCAGTGGCTTTCGGTGCTCCCATCGGTGGAGTTCTCTTTAGTCTTGAAGAGGTATCTTACTACTTTCCCCTGAAAACCCTCTGGCGTTCATTTTTCTGCGCGTTGATAGCCGCCTTCATCTTACGATCCATTAACCCCTTCGGCAACGAGCACTCGGTTCTCTTCTTCGTGGAGTACAACAAGCCCTGGATATTCTTCGAGTTGATACCTTTCGTCGGCTTGGGAATCATTGGCGGTTGCATCGCGACAATATTCATCAAGGCGAACATTTACTGGTGCCGCTACCGGAAGTACTCCAAGCTGGGTCAGTACCCAGTGACGGAGGTGCTGGTGGTGACCCTAGTGACCGCGATCATCGCCTATCCAAATCCATACACCAGAATGAACACCAGCCAGTTGATCTACTTGCTATTCAACCAGTGCGGCATATCTAACTCGGATCCTCTGTGTGACTATAATAGGAATTTCACCGACGTGAATAAGGCGATTGAGAAGGCTGCCGCTGGTCCTGGTGTGTACCAGGCTATCTGGCTGTTGATGTTGGCACTGGTGCTGAAGTTGGTGATGACCGTGTTCACCTTCGGCATTAAAGTACCCTGCGGGCTGTTCATACCCAGCCTCGCGCTCGGAGCCATCGCCGGCAGGATTGTGGGCATTGGTGTGGAACAGCTCGCGTATAAGTATCCGAAGATCTGGTTATTCTCTGGAGAATGTTCTACTGGCGATGACTGCATCACTCCAGGGTTGTACGCTATGGTTGGTGCTGCGGCTGTACTCGGCGGTGTTACGAGGATGACCGTGTCTCTGGTGGTGATAATGTTCGAGCTGACTGGCGGCGTGCGGTACATAGTGCCGCTAATGGCGGCGGCTATGGCGTCCAAGTGGGTGGGCGATGCGTTGGGGCGCCAGGGTATATACGACGCCCACATCGCGCTGAACGGATACCCGTTCCTGGACAGCAAGGACGAGTTCCAGCATACGTCACTCGCTGCTGACGTCATGCAACCCAAACGTAACGAGACCCTCTCCGTCATAACGCAAGACTCGATGACCGTTGATGATGTGGAGACGCTGCTGAAAGAGACAGAGCATAACGGATATCCGGTGGTCGTGTCCAAGGAGTCGCAATACCTCGTCGGATTCGTACTGAGACGGGACCTTAACCTGGCCATAGATGATAAAGAGACAATTTTATGCAGTATCGTACAATCAACAAGCAACTCCATCAATGGCATCCTCCGACACGGCCGTCCTGACATATCACACGCCCTCGTGTGA

Protein sequence:

>DPOGS203340-PA
MRHRYIVKKRQDSIWNLIKGAHDAWSGWVCVLLVGVCTGVVAGVIDIGASWMTDLKFGICPQAFWFNREQCCWSNDEITFDHGNCSQWMTWAQLFGESKEGVGAYIISYLFYIVWALLFAALSASLVRMFAPYACGSGIPEIKTILSGFIIRGYLGKWTLVIKVVGLILSVSSGLSLGKEGPMVHIASCLGNILSYLFPKYGRNEAKKREILSAAAAAGVSVAFGAPIGGVLFSLEEVSYYFPLKTLWRSFFCALIAAFILRSINPFGNEHSVLFFVEYNKPWIFFELIPFVGLGIIGGCIATIFIKANIYWCRYRKYSKLGQYPVTEVLVVTLVTAIIAYPNPYTRMNTSQLIYLLFNQCGISNSDPLCDYNRNFTDVNKAIEKAAAGPGVYQAIWLLMLALVLKLVMTVFTFGIKVPCGLFIPSLALGAIAGRIVGIGVEQLAYKYPKIWLFSGECSTGDDCITPGLYAMVGAAAVLGGVTRMTVSLVVIMFELTGGVRYIVPLMAAAMASKWVGDALGRQGIYDAHIALNGYPFLDSKDEFQHTSLAADVMQPKRNETLSVITQDSMTVDDVETLLKETEHNGYPVVVSKESQYLVGFVLRRDLNLAIDDKETILCSIVQSTSNSINGILRHGRPDISHALV-