Monarch geneset OGS2.0

DPOGS211253
TranscriptDPOGS211253-TA1968 bp
ProteinDPOGS211253-PA655 aa
Genomic positionDPSCF300425 - 17928-20993
RNAseq coverage65x (Rank: top 67%)
Annotation
HeliconiusHMEL0126910.090.24% 
BombyxBGIBMGA005373-TA0.087.04% 
DrosophilaCG13248-PA0.054.19% 
EBI UniRef50UniRef50_Q6NNV90.054.04%RH24371p n=16 Tax=Endopterygota RepID=Q6NNV9_DROME
NCBI RefSeqXP_001845829.10.057.94%cationic amino acid transporter 4 [Culex quinquefasciatus]
NCBI nr blastpgi|1700359500.057.94%cationic amino acid transporter 4 [Culex quinquefasciatus]
NCBI nr blastxgi|583861790.057.59%AGAP010567-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00160204.6e-225membrane
GO:00033334.6e-225amino acid transmembrane transport
GO:00151714.6e-225amino acid transmembrane transporter activity
GO:00068101.1e-35transport
GO:00550851.1e-35transmembrane transport
KEGG pathway 
InterPro domain[7-655] IPR0156064.6e-225Cationic amino acid transporter
[7-655] IPR0022934.6e-225Amino acid/polyamine transporter I
[44-414] IPR0048411.1e-35Amino acid permease domain
Orthology groupMCL12116 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211253-TA
ATGCCTGGAGCCAGACACAAAATCCTGGGGCACGTCCTTTCAGGATTTTGTCATAAGATGAATCGACGGAAGCCTATTTATGGAGACGTTTTGGACACGCCACTTAATCGTTGTCTAACAACTTTTGATATAACATTACTTGGGGTCGGTCATATGGTCGGAGCTGGGATCTATGTCTTAACAGGAACCGTTGCTCGTCATATGGCTGGACCAGCCACAGCCCTTAGTTTTCTACTAGCTGGAATTACATCTACACTAGCAGCGTTATGCTATGCCGAATTTGGCACAAGGATACCCAGGGCAGGAAGCGCTTACGCATACACCTATGTTAGCATTGGCGAGTTTTGGGCGTTTATAATTGGTTGGAATATTGTGCTTGAATACATGATAGGTGCTGCGTCAGTTGCTCGTGCTTGGTCAGGATATCTAGATGCTATACTAGACGGCGCAATAAGCAATGCCACGATTTCTGTCACAGGGGAGTTGCACGAAACCCTCCTAAGTAGATACCCAGACGTGCTAGCATTTCTTATATGCATAGTAGCGTCGTTAATTCTAGCCGTGGGCGTTAAAACTTCCGCGTATATAAACAATGGCCTTACCATTTTAAATCTTACCGTCATTTCTTTGGTTATATTTTTAGGTTTTTATTACGCAGATATTACCAATTGGTCTGAGAAGAACGGAGGTTTTATGCCATATGGATTTAGCGGAGTGTTAGCTGGTGCGGCTACCTGTTTTTATGCCTTTGTAGGTTTTGACAGTATATCTGCTTCTAGCGAGGAAGCCAAGGACCCATCGCGTTCGATACCTATTGCCACTATATTGTCCATGGTAATGGTGGGACTTGGTTATATACTGGTTGCGATAGCTTTAACGCTGATGGTGCCATATAGTACTATTAATCCGGAGGCAGCATTACCTGCAGCTTTGGGCGCGGTTCATGCTGATTGGGCCAAGTATGCTGTAGCTGTGGGCGCAGTATGCGGAATGACAACTACACTTTTAGGATCCTTGTTCTCGTTGCCACGATGCTTATATGCAATGTCAGCTGATGGACTCTTGTTCGGATTTCTCAGCGATGTTAATAATAAATCACAAATACCTATATCGAACCTCATCATAGCAGGCTTCTCTTCTGCATTCATTGCTCTTTTGTTCGATCTAGAGAAACTTGTGGAATTTATGTCCATTGGTACGCTATTGGCGTACACAATAGTTAGTGCAGCGGTTATCATATTGCGCTATCGACCTATTCCTCCTGAGGACAAAAGTTTCGGAGTGCCACAATTGGACTCGCCTATTGATAGAGAGGACAGCTCCGCTACTGGGACTCCGGCGACTGATGGCGGGTCATCCTCTTCAGAGATGTTTGAGGCGCTGACTGTGGGTCGGGTTCGTGCGCAATATGCTTGGCTTGAACCTCTGGCTGGAGGGCGAGCACCCGGGGCTGCGGTAACGTCTTGTGTGTACGTGTTCACTGTTGCGACCGCGGCTCTCTGCGCCCACAACCACTTCTTAGCACCGAACGCTGGACCCTGGGCTCTACTACCGGATTTCGTACTGACGTTTATAATTATTGCTTGTCTGTTTATTATCTGGGCTCATCAACAAAGCCCTCTGCGGCTGCCGTTCCGCGTCCCTTGGGTGCCGCTGCTGCCGGCGGCCAGCGTCATGCTCAATGTCGAGCTAATGGTCAACCTAAATGCACTAACATGGGCACGATTCACTATTTGGATGACATTTGGTTTACTCCTGTATTTCCTATACGGCATACATCACAGCAAACTCGGCGAAGGTGTAACAGGTTTGTTATCACGATCTGGTAGCGGTGGAGCTCACAGTTGGGGCGCAGTGGATAAAACCTCTTCGCTTGCAAGACGAGTAGGACGCTTGGGACGTGGTTGTAAGAGCGACGACCGCAAGCCAATAATAAGCGACGACGAACTAAACAGACGCGAACAGTGA

Protein sequence:

>DPOGS211253-PA
MPGARHKILGHVLSGFCHKMNRRKPIYGDVLDTPLNRCLTTFDITLLGVGHMVGAGIYVLTGTVARHMAGPATALSFLLAGITSTLAALCYAEFGTRIPRAGSAYAYTYVSIGEFWAFIIGWNIVLEYMIGAASVARAWSGYLDAILDGAISNATISVTGELHETLLSRYPDVLAFLICIVASLILAVGVKTSAYINNGLTILNLTVISLVIFLGFYYADITNWSEKNGGFMPYGFSGVLAGAATCFYAFVGFDSISASSEEAKDPSRSIPIATILSMVMVGLGYILVAIALTLMVPYSTINPEAALPAALGAVHADWAKYAVAVGAVCGMTTTLLGSLFSLPRCLYAMSADGLLFGFLSDVNNKSQIPISNLIIAGFSSAFIALLFDLEKLVEFMSIGTLLAYTIVSAAVIILRYRPIPPEDKSFGVPQLDSPIDREDSSATGTPATDGGSSSSEMFEALTVGRVRAQYAWLEPLAGGRAPGAAVTSCVYVFTVATAALCAHNHFLAPNAGPWALLPDFVLTFIIIACLFIIWAHQQSPLRLPFRVPWVPLLPAASVMLNVELMVNLNALTWARFTIWMTFGLLLYFLYGIHHSKLGEGVTGLLSRSGSGGAHSWGAVDKTSSLARRVGRLGRGCKSDDRKPIISDDELNRREQ-