Monarch geneset OGS2.0

DPOGS202985
TranscriptDPOGS202985-TA1515 bp
ProteinDPOGS202985-PA504 aa
Genomic positionDPSCF300068 - 444298-447848
RNAseq coverage1139x (Rank: top 11%)
Annotation
HeliconiusHMEL0110410.090.68% 
BombyxBGIBMGA012328-TA0.083.06% 
DrosophilaCG6327-PB9e-15456.72% 
EBI UniRef50UniRef50_F4WE430.068.20%Proton-coupled amino acid transporter 4 n=3 Tax=Formicidae RepID=F4WE43_ACREC
NCBI RefSeqXP_394217.10.069.84%PREDICTED: similar to CG6327-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3320273980.068.20%Proton-coupled amino acid transporter 4 [Acromyrmex echinatior]
NCBI nr blastxgi|3320273980.069.33%Proton-coupled amino acid transporter 4 [Acromyrmex echinatior]
Group
KEGG pathway 
InterPro domain[89-497] IPR0130574.8e-77Amino acid transporter, transmembrane
Orthology groupMCL15676 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202985-TA
ATGGTCGAACCGGCACCGGTCCAGCGAGCCCCTAAGAGCTGCAAACCACAACTGCGGCCGATGATAGCCGAATATGATCCTAAGAGAAAAGGAGTCAAAAATGACTTGTCAGATGTCGTTATGGTCAAGTACAAAGTTGATCCAAATGAGATTCCGGTGGAACAACAGGCGGGCTCGACCCTCCCTCTCATGGAAATACCCGGTCGGGATATTGAGGCAGACGAGGACTATAACCCCTTCGACCACAGAAAACTGGCTCACCCCACTTCGGACATGGATACTCTTATCCATTTGCTCAAGGGATCGTTGGGTAGTGGTATATTGGCCATGCCCATGGCCTTCATGAACGCTGGTCTCTACTTCGGTCTAGTGGCGACCTTCCTTATCGGTGGCATTTGCACGTACTGCGTCCACGTGTTGGTCAAAACGTCTCACGAACTCTGTAAGCGGATACAAAAACCCTCCCTAGGATTCGCAGAGACAGCGGAGGCCGCATTCTTATCAGGACCACCAGCTGTTCACAAATTTTCGAGACTCGCCAAAGCTATAATAAATTGGTTCCTCGTAGTGGACTTATTGGGCTGCTGCTGCGTCTACATCGTATTTATTTCAACAAACGTTAAGCAAGTAGTAGACTTCTATGCCGAGAAAAGTGACTGGCTCCACCACGACTTGGATTTGCGTATATACATGGTTGCGTTGCTTCCGTTCCTCATTGCAATGAACCTCATTAGGAATCTCAAGTATCTGGCACCGTTCTCCATGATTGCGAATCTTTTAGTCGGAACCGGGATGGGCATCACCTTCTATTACCTATACCAAGATATTCCAAGCATCAGTGACCGTAAACCCTTCGCTGGATTCGAGCGCCTTCCTACTTTCTTTGGTACCGCTATCTTCGCTCTTGAGGGTATTGGCGTCGTGATGCCATTAGAGAATAACATGAAAACACCTACTCACTTCATCGGCTGCCCCGGAGTCCTGAATACTGGCATGTTCTTCGTAGTCTCCCTTTATGCCATTGTAGGATTCTCTGGATACCTCAAATACGGTGATGCAACTGGAGCTAGTATTACATTGAACTTGCCCCAAGACGAAGTGTTGGGCCAGAGTGTGAAATTGATGATCGCTGTTGCTATCTTCTTCACGTACAGTCTTCAGTTTTACGTTCCCATGGAAATCATCTGGAAGAACGTTCGTCACATGTTCGGCTCCAAGAAGAATATTGCTGAGTACAGCATCAGGATCGGCATCGTCATCATGACCCTCTGCACTGCCATTGCCATCCCAAACCTGGGCCCGTTCATCTCGTTGGTTGGCGCCGTCTGCCTCTCCTTCTTGGGTCTCATTTTTCCAGCAGTCATCGAAACCGTCACCTTTTGGGACCGACCCAACGGTCTCGGTCGTTTCAACTGGGTCCTTTGGAAGAACCTTTTCTTAATCTGCTTCGGTATCCTCGGCTTCCTCACAGGTTCCTATGTTAGTATTTTAGATATAATCAAGGGTGAAGACTAA

Protein sequence:

>DPOGS202985-PA
MVEPAPVQRAPKSCKPQLRPMIAEYDPKRKGVKNDLSDVVMVKYKVDPNEIPVEQQAGSTLPLMEIPGRDIEADEDYNPFDHRKLAHPTSDMDTLIHLLKGSLGSGILAMPMAFMNAGLYFGLVATFLIGGICTYCVHVLVKTSHELCKRIQKPSLGFAETAEAAFLSGPPAVHKFSRLAKAIINWFLVVDLLGCCCVYIVFISTNVKQVVDFYAEKSDWLHHDLDLRIYMVALLPFLIAMNLIRNLKYLAPFSMIANLLVGTGMGITFYYLYQDIPSISDRKPFAGFERLPTFFGTAIFALEGIGVVMPLENNMKTPTHFIGCPGVLNTGMFFVVSLYAIVGFSGYLKYGDATGASITLNLPQDEVLGQSVKLMIAVAIFFTYSLQFYVPMEIIWKNVRHMFGSKKNIAEYSIRIGIVIMTLCTAIAIPNLGPFISLVGAVCLSFLGLIFPAVIETVTFWDRPNGLGRFNWVLWKNLFLICFGILGFLTGSYVSILDIIKGED-