Monarch geneset OGS2.0

DPOGS207548
TranscriptDPOGS207548-TA1995 bp
ProteinDPOGS207548-PA664 aa
Genomic positionDPSCF300072 - 1046489-1051535
RNAseq coverage452x (Rank: top 27%)
Annotation
HeliconiusHMEL0164060.081.93% 
BombyxBGIBMGA009965-TA3e-17370.62% 
DrosophilaCG1311-PA7e-17847.59% 
EBI UniRef50UniRef50_F4WSJ70.052.44%CTL-like protein 1 n=9 Tax=Endopterygota RepID=F4WSJ7_ACREC
NCBI RefSeqXP_970467.10.057.03%PREDICTED: similar to GA12051-PA [Tribolium castaneum]
NCBI nr blastpgi|910904000.057.03%PREDICTED: similar to GA12051-PA [Tribolium castaneum]
NCBI nr blastxgi|910904000.057.03%PREDICTED: similar to GA12051-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[19-662] IPR0076033e-246Choline transporter-like
Orthology groupMCL13913 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207548-TA
ATGGGTTGCACGCAAAGCGAGATTGCTCCTCAAAATGAAACGCCCAAAAAAGGCTGCACTGATGTATTATGGCTTGTTATATATATTATATTTTGGATTCTAATGATTATTATTGCGGCTATATCTTTTGTTTACGGAAATCCTCAGAGATTAATAAATGGATATGATTCATTCGGCAATACATGCGGTGTCAAGAATAACAAGAAACTTATAAATTTCCCATTAGCTGGTATCAGTACTGCTGATAAAAGTTATTTATTTTTTATGGATATTAAGGAACTTCGAAGATCACTTAAAATATGCGTTAAACAATGCCCCAATAAAAAGTTGGAATCATTCAATGATCTGCAAAACTTTTACCGGGAGACTGGATCAAATCTCTGCAGATACGATATCCATTTGAATAATGTAACAAGCACCAAAGATCTGCACAATTTTATAGGTCCTTGTCCAACATTGCCTGTATATGAGTCATTTCCACTTTTAAATAGATGTTTTCCTAAATCTGCTAAAGATTTAGCTCAGAAAGTTTTCAGTGATTTTTATGACTTACTAAATAGTTGGGATACCATCGAACAGATGTTATCAGACCTGTACTCATCTTGGAAGGAAATGATTATTTGTGTTATAATTGCATTTATTTGTTCATTGATCATGGTATCAATCCTGCATTTATTAGCATCTCTAGTGTCATGGATTTTCATGATAATAGTATCCATTGCTAGTATAGCTGGAACGGCCCTCCTGTGGTACACATATCATGAGTTGAAAACTAAGCAAAGGGATTTTTCTGATACAACTATTTACTTAGCTGAATCTCTTAAGAATGAAAAAGCCTTCCTATGGTATTCTATCATTGCAACCATAATAACTGTAATATTATTGTTACTAGTGTGGGTGATGAGATCACGTGTCTCTTTCTTGGCAGATTTGTTCAAGGAGACTGCACACTGTTTGGGTTCCATACCAGCTCTCTTCCTCCAGCCAATTATAACATTTTTCTTCCTTATATTATTCCTCACCTTTTGGTCACTAGTTGTAGTGTGCTTGGCAACTGCAAATTATCCAGGTATACCATACAAAACAAACTTCTTTATTAATGGAACTCCTTTACCTGACCATGTTGATAATGCAGCAGTTCAAGCACAAAATATCGACAAAAATGCTAATCTCAAGAGTTATGTTTTAGACCCAATCGAGTTTGATCCAATGTGGGTGAAGAGCATGTGGTGGATGTGTCTCATATGCCTGGTCTGGGGCAGTGAATTCATTTTGGGATGTCAGCAGATGACTATCGCTGGTGCCGTCTCACATTGGTACTTCAGAGGTCCAAACGCAAATCCATCGCCGGTACTATATTCCATAGGCAAACTTTTGAAATATCACCTAGGTTCTGTTGCTAAAGGATCTTTCTTGATAACTCTTTTCAAAATACCACGACTCATTCTCACCTACCTACATGCTAAGTTGTCTGCAAGAGCCGAAAAGGGTTCAGATTGTGCGAAATGCGGTCTCAAGTGTGGAATTTGCTGTTTCTACTGTCTGGAAAAATTCATACGCTATTTAAATCACAACGCGTACACTATCATAACGATCGATAGATGTCACTTTTGTAAAGCAGCGGGAAAGGCGTTCAGTACAATTGTGAACAACGCACTTCAAGTGGCGACGATCAATAGTGTTGGGGACTTCATACTTTTCCTTGGGAAGTGCATTGTGACGGCTCTAACAGGCATTGTTGGACTTCTCATGTTGAAGAGAAATCCAGATCTACATTTCTTTGCAGTTCCCACTCTCGTTATTTGCATCTTCTCATTCTTTATCGCGCATTGTATCCTGTCCTTGTATGAGATGGTTGTGGATTCTCTGTTCCTGTGTGTGTGCGAGGATCGTAACAGTAATAACGATGGACGTTGGCAACACTCGCGACTGGCTGAACTCGGTTTGAACAAGACTGACGCGACAGATGGCGCTGAAATGCAGGATCTCAAATGA

Protein sequence:

>DPOGS207548-PA
MGCTQSEIAPQNETPKKGCTDVLWLVIYIIFWILMIIIAAISFVYGNPQRLINGYDSFGNTCGVKNNKKLINFPLAGISTADKSYLFFMDIKELRRSLKICVKQCPNKKLESFNDLQNFYRETGSNLCRYDIHLNNVTSTKDLHNFIGPCPTLPVYESFPLLNRCFPKSAKDLAQKVFSDFYDLLNSWDTIEQMLSDLYSSWKEMIICVIIAFICSLIMVSILHLLASLVSWIFMIIVSIASIAGTALLWYTYHELKTKQRDFSDTTIYLAESLKNEKAFLWYSIIATIITVILLLLVWVMRSRVSFLADLFKETAHCLGSIPALFLQPIITFFFLILFLTFWSLVVVCLATANYPGIPYKTNFFINGTPLPDHVDNAAVQAQNIDKNANLKSYVLDPIEFDPMWVKSMWWMCLICLVWGSEFILGCQQMTIAGAVSHWYFRGPNANPSPVLYSIGKLLKYHLGSVAKGSFLITLFKIPRLILTYLHAKLSARAEKGSDCAKCGLKCGICCFYCLEKFIRYLNHNAYTIITIDRCHFCKAAGKAFSTIVNNALQVATINSVGDFILFLGKCIVTALTGIVGLLMLKRNPDLHFFAVPTLVICIFSFFIAHCILSLYEMVVDSLFLCVCEDRNSNNDGRWQHSRLAELGLNKTDATDGAEMQDLK-