Monarch geneset OGS2.0

DPOGS210544
TranscriptDPOGS210544-TA1785 bp
ProteinDPOGS210544-PA594 aa
Genomic positionDPSCF300304 - 82651-91342
RNAseq coverage105x (Rank: top 60%)
Annotation
HeliconiusHMEL0064293e-14891.75% 
BombyxBGIBMGA013465-TA1e-14694.10% 
DrosophilaCG7708-PB0.081.71% 
EBI UniRef50UniRef50_Q9VE460.081.71%High-affinity choline transporter 1 n=15 Tax=Coelomata RepID=SC5A7_DROME
NCBI RefSeqXP_312589.20.083.48%AGAP002366-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|509112040.095.62%high-affinity choline transporter [Trichoplusia ni]
NCBI nr blastxgi|509112040.095.62%high-affinity choline transporter [Trichoplusia ni]
Group
Gene OntologyGO:00160202.7e-301membrane
GO:00068102.7e-301transport
GO:00550852.7e-301transmembrane transport
GO:00052152.7e-301transporter activity
KEGG pathway 
InterPro domain[1-582] IPR0017342.7e-301Sodium/solute symporter
Orthology groupMCL13362 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210544-TA
ATGATCAACATAGCCGGAGTGATCTCCATCATCCTCTTCTACGTTCTAATTCTGGCCGTGGGGATATGGGCGGGGCGCAAGAAGCCGCCAGGTAACGACTCCGAAGAAGAAGTGATGCTGGCAGGACGATCCATTGGATTGTTCGTTGGCATCTTCACTATGACCGCGACGTGGGTGGGTGGAGGCTACATCAATGGCACCGCTGAAGCCATCTATACTTCGGGGCTGGTGTGGTGTCAGGCCCCCTTCGGGTACGCGCTGTCACTCGTCTTTGGCGGCATCTTCTTCGCCAATCCGATGCGTAAACAGGGTTACGTGACCATGTTGGATCCTCTCCAGGACTCGTTCGGCAGTCGTATGGGAGGACTGTTGTTCCTGCCCGCGCTCTGTGGGGAGGTGTTCTGGGCTGCTGGAATCCTTGCAGCACTTGGCGCCACGCTGGCCGTCATCATAGACATGGACCACCGCACGTCCGTCATCTTCAGCGCCTGCGTCGCCGTCTTCTACACCCTGTTCGGAGGCCTGTACTCCGTCGCCTACACTGACGTCATCCAGCTGTTCTGTATCTTCATCGGCCTCTGGATGTGCATTCCTTTTGCCTGGGCCAACGAACACGTCAAACCTCTCAGCTCTATGGAAGTGGACTGGATAGGAAAGATCGATCCGGAGTATTACTGGTTCTATGTGGATTACGGGCTATTGCTCATTTTTGGAGGTATCCCGTGGCAGGTATACTTTCAGCGTGTTCTTAGCAGCAAGACCGCCGGCCGAGCTCAAGTGCTGTCGTATGTGGCAGCTATAGGTTGTATCTTCATGGCCATACCGCCCGTACTTATTGGAGCCATCGCTAAAGGCACCGCTTGGAACGAGACGGAGTATCACGGACCTTTGACGGCCGATGGTGAGTTGACTTCGGAACAAACGTCGATGATCCTGCCGCTCGTGCTGCAACACCTCACTCCAGACTTCGTGTCATTCTTCGGTCTGGGTGCTGTCTCAGCTGCGGTGATGTCGTCAGCTGACTCCAGCGTGCTCAGCGCTTCCTCTATGTTTGCGAGAAACGTGTACAAGCTGATATTCCGTCAAAACGCTTCTGAGATGGAGATCATTTGGGTGATGCGTGTGTCTATCCTGGTGGTGGGAGTCCTCTCCACCATCATGGCTCTCACCATCCCCTCTATATATGGCTTGTGGTCGATGTGCTCGGACCTGGTGTATGTCATCCTGTTTCCTCAACTGCTGATGGTGGTGCACTTCAAGCATCACTGTAACACGTACGGCTCGCTGGCGGCCTATATTGTTGCTTTAATGGTGCGACTCTCCGGTGGTGAGAAGCTGCTGGGCCTGCCCGCTCTCATCCACTACCCCGGGTGGGACGCCACCAACGAGGTTCAGTTGTTCCCCTTCAGGACGCTGGCCATGGTGCTGTCACTCTTCACACTGGCTTTCATCTCGTGGCTATCCGTGTTCTTGTTCAACTCCGGTTACTTGAGTCCTGAATCCGATTACTTTAACTGCGTCGTGAACATCCCCGAGGATATTAGAAGAGTGGACGAGCCCTCCGAGGCTGGGGAGCAGATGTCGGTCCTGGGAGGAGTTCCCGGTAGGCTATATGGAGCTGCTTCCACTCTCGTGGGTCCGGATGAGAAGTCGGGAAGGATCAACCCGGCCCTGGAACCCGACGATGACGATCTGGACGACCCCAATGATGGCCCCACGCCGCTCTTCGGATCCAGCAGTGCACCTCCACCGGTCCAGCCGTTCGGAAAACAGACAGCTTTCTGA

Protein sequence:

>DPOGS210544-PA
MINIAGVISIILFYVLILAVGIWAGRKKPPGNDSEEEVMLAGRSIGLFVGIFTMTATWVGGGYINGTAEAIYTSGLVWCQAPFGYALSLVFGGIFFANPMRKQGYVTMLDPLQDSFGSRMGGLLFLPALCGEVFWAAGILAALGATLAVIIDMDHRTSVIFSACVAVFYTLFGGLYSVAYTDVIQLFCIFIGLWMCIPFAWANEHVKPLSSMEVDWIGKIDPEYYWFYVDYGLLLIFGGIPWQVYFQRVLSSKTAGRAQVLSYVAAIGCIFMAIPPVLIGAIAKGTAWNETEYHGPLTADGELTSEQTSMILPLVLQHLTPDFVSFFGLGAVSAAVMSSADSSVLSASSMFARNVYKLIFRQNASEMEIIWVMRVSILVVGVLSTIMALTIPSIYGLWSMCSDLVYVILFPQLLMVVHFKHHCNTYGSLAAYIVALMVRLSGGEKLLGLPALIHYPGWDATNEVQLFPFRTLAMVLSLFTLAFISWLSVFLFNSGYLSPESDYFNCVVNIPEDIRRVDEPSEAGEQMSVLGGVPGRLYGAASTLVGPDEKSGRINPALEPDDDDLDDPNDGPTPLFGSSSAPPPVQPFGKQTAF-