Monarch geneset OGS2.0

DPOGS201174
TranscriptDPOGS201174-TA1767 bp
ProteinDPOGS201174-PA588 aa
Genomic positionDPSCF300262 - 167142-172955
RNAseq coverage873x (Rank: top 15%)
Annotation
HeliconiusHMEL0180115e-14374.02% 
BombyxBGIBMGA014265-TA0.078.00% 
Drosophilaslif-PC0.061.07% 
EBI UniRef50UniRef50_Q9VNP70.061.07%LD37241p n=26 Tax=Eumetazoa RepID=Q9VNP7_DROME
NCBI RefSeqXP_001958596.10.061.55%GF11006 [Drosophila ananassae]
NCBI nr blastpgi|1947525740.061.55%GF11006 [Drosophila ananassae]
NCBI nr blastxgi|3800193470.058.72%PREDICTED: LOW QUALITY PROTEIN: high affinity cationic amino acid transporter 1-like [Apis florea]
Group
Gene OntologyGO:00160203.1e-229membrane
GO:00033333.1e-229amino acid transmembrane transport
GO:00151713.1e-229amino acid transmembrane transporter activity
GO:00068101.5e-39transport
GO:00550851.5e-39transmembrane transport
KEGG pathway 
InterPro domain[3-569] IPR0156063.1e-229Cationic amino acid transporter
[3-569] IPR0022933.1e-229Amino acid/polyamine transporter I
[36-417] IPR0048411.5e-39Amino acid permease domain
Orthology groupMCL10195 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201174-TA
ATGGGTTGCGCCAGAATTTTCGCTGCGTTACGCCGCTGCAAACAGCTTGAGGAGGGCGAGATCACGACCCAGCTGTCTAGGTGCCTCGGTCTCCTGGACCTGACGGCCCTGGGAGTGGGCAGCACCCTCGGTCTGGGCGTCTATGTGCTGGCCGGTGCCGTCGCTAAAACCGTCGCAGGTCCAGCAGTTACCCTGAGCTTTCTTGTTGCAGCCATTGCCTCTGCGTTTGCAGGCTTATGCTACGCCGAATTCGCGTCTAGAGTTCCAAAAGCCGGCTCCGCGTACGTGTACAGCTATGTGAGCGTCGGTGAATTCATAGCGTTTACGATCGGCTGGAACCTCATCTTGGAGTATGTCATAGGAACGGCCAGCGTGGCGAAGGGCATGGCCAATTACATAGACAGTCTCTGTAACAACACTATGGCCGAGACCATGACGCGCATCGCCCCTATAAACGTCTCTTTCCTGGCGGATTATCCGGACATCTTCGCCTTCACACTCGTACTCCTCATAACCATTCTCCTCGGTATCGGAGTGAGCGAGTCAACTAAACTGAACAATGTATTCACCGCACTCAACATGGTCACCGTCATAATCGTCGTAGTTGCGGGTGCTATTAAAAGTGACCCAGCCAACTGGCGCATCGATGTCCAGGAAATACCAGAGGAGTATCGTGACAAGGCCGGCGGTGGAGGGTTCATGCCGTGGGGGATGGCGGGTGTCATGGCTGGAGCAGCGAAGTGCTTCTTCGGCTTCGTAGGCTTCGACTGCGTGGCGACAACCGGCGAAGAAGCCAAAAATCCCAAAAGAGACATTCCACTCTCCATAGTCTTGTCTCTCGTGATTATTTTCGTATCATACTTCAGTATAGCCACCGTCTTAACAATGATGTGGCCTTACTATTTACAGGACGCCGACGCTCCCTTCCCGCACGTGTTCGACGAGTCCGGGATGCCGGTCATCAAATGGATTGTAACGATAGGGGCTGTGTTTGCTCTCTGCACCAGTCTGTTGGGCGCGATGTTCCCTTTACCGAGGGTCCTGTACGCCATGGGCTCGGACGGTGTTCTGTTCAAGCCGCTAGCTGTTATTCACAAGAGAACCAAAACGCCGTTGCTGGCGACCGGGTTGAGTGGATTGTTTTCAGCTGTGATGGCGGCCATCTTTAATCTGAACCAGCTGATAGATATGATGTCTATCGGGACGTTACTGGCTTATACAATTGTTGCTACAAGCGTATTAATTCTAAGATATGAAGAGGAGCATCCGTTGACGGTCAAAGACAAATCATTAAGAGTAGGAGGACCGCGAGCGACGATCCTGCAGACTTGCAACTTGCTGGGTCTCAAACACCCCACGGAACTGTCCGCCACCATCGCCAAGTGTACCATCGGGATATTTTTCGTGTGTATGTTGGTGTGTTGTGTCGTGATGCAATGGTCTTCAAGTGTGGCGGTGTGGAGCGCGATAGGGGCCGTCCTGCTGGTACTGCTCGTCGTACTCTACCGGCAGCCTCGCGCTGACGTCACACAACTTAGCTTTAAGGTGCCGCTGGTGCCGCTAGTTCCTTACCTCAGCGTGTGTATGAACCTGTACCTCATGGCGCAACTCGACTACCAGACTTGGGTCCGCTTCATCTTATGGCTGGTTATAGGCTACGCTATTTACTTCTTCTATGGTCTTCGCAACAGTACCCTGAGAGAGAAGAAGCTGCCAGTCATGAACGGAAATGGAGTCCATGTGAAGCAAATCGTCACCAAATTTTGA

Protein sequence:

>DPOGS201174-PA
MGCARIFAALRRCKQLEEGEITTQLSRCLGLLDLTALGVGSTLGLGVYVLAGAVAKTVAGPAVTLSFLVAAIASAFAGLCYAEFASRVPKAGSAYVYSYVSVGEFIAFTIGWNLILEYVIGTASVAKGMANYIDSLCNNTMAETMTRIAPINVSFLADYPDIFAFTLVLLITILLGIGVSESTKLNNVFTALNMVTVIIVVVAGAIKSDPANWRIDVQEIPEEYRDKAGGGGFMPWGMAGVMAGAAKCFFGFVGFDCVATTGEEAKNPKRDIPLSIVLSLVIIFVSYFSIATVLTMMWPYYLQDADAPFPHVFDESGMPVIKWIVTIGAVFALCTSLLGAMFPLPRVLYAMGSDGVLFKPLAVIHKRTKTPLLATGLSGLFSAVMAAIFNLNQLIDMMSIGTLLAYTIVATSVLILRYEEEHPLTVKDKSLRVGGPRATILQTCNLLGLKHPTELSATIAKCTIGIFFVCMLVCCVVMQWSSSVAVWSAIGAVLLVLLVVLYRQPRADVTQLSFKVPLVPLVPYLSVCMNLYLMAQLDYQTWVRFILWLVIGYAIYFFYGLRNSTLREKKLPVMNGNGVHVKQIVTKF-