Monarch geneset OGS2.0

DPOGS201196
TranscriptDPOGS201196-TA3024 bp
ProteinDPOGS201196-PA1007 aa
Genomic positionDPSCF300262 + 206478-214139
RNAseq coverage148x (Rank: top 54%)
Annotation
HeliconiusHMEL0047400.064.13% 
BombyxBGIBMGA014240-TA0.053.57% 
Drosophilaslif-PC2e-14446.23% 
EBI UniRef50UniRef50_UPI000224675F2e-14745.81%UPI000224675F related cluster n=4 Tax=unknown RepID=UPI000224675F
NCBI RefSeqXP_002095861.17e-14445.74%GE22647 [Drosophila yakuba]
NCBI nr blastpgi|3454813986e-14745.81%PREDICTED: hypothetical protein LOC100119236 [Nasonia vitripennis]
NCBI nr blastxgi|2700129321e-14435.09%hypothetical protein TcasGA2_TC001941 [Tribolium castaneum]
Group
Gene OntologyGO:00160206.3e-195membrane
GO:00033336.3e-195amino acid transmembrane transport
GO:00151716.3e-195amino acid transmembrane transporter activity
GO:00068103.1e-29transport
GO:00550853.1e-29transmembrane transport
KEGG pathway 
InterPro domain[12-545] IPR0156066.3e-195Cationic amino acid transporter
[12-545] IPR0022936.3e-195Amino acid/polyamine transporter I
[40-424] IPR0048413.1e-29Amino acid permease domain
Orthology groupMCL26183 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201196-TA
ATGGCAGTCAAGTGGCGTGATCTGCTCTGCACGCTGAAACGAAGGCGAGTCTTCGAACCCGATCAGCTCGACGTGGGCAATCTGAGACGATGTCTGTCAGTATGGGATCTGACGGCGCTGGGTGTGGGGAGCTCGCTGGGAGTGGGTGTGTACGTGCTGGTGGGATCCGTTGCGCTTCACCTGGCCGGACCCTCCATAGTGCTTTCATTCCTCATTGCAGCTGTAGCAGCTGTTGTGGCTGCAATGTGTTACGCGGAGCTGGGGTCGCGGGTTCCGAAAGCTGGGTCTGCATACATATACACATATGTGACTGTTGGAGAGATAGTGGCCTTCATTATTGGTTGGAACATGATCCTGGAACTGGTCTTTGGGACGGCGAGTGTCGCCCGGGGTCTCAGCATGTACGTGGACTCGGTCACCAACAAGACCATGTCCTCGTGGATGGAGTCGCTCGTTCCGATTCACTCTGATTACTTCTCCTCCTACTTTGATATCTTTTCATTCTTCGTGGTCGTATTTTTGGGGGTGCTTCTGGCTGTTGGTGTGCGCGAGTCAACATTTGTGAACAATTTGTTGACAGCTGTCAATATACTCGTCATAGTGTTCATCATATGCGCAGGCGCATTCAAAGCGGACTTCAGTAACTGGAATATCCCACCGAGCGAGGTGCCGTCAGGTCGTGGTGTGGGAGGATTCTTCCCTTACGGCATCTGGGGCACGTTAAGAGGGGCCGCCCTATGCTTTTATGGATTCGTCGGATTTGATAGCATTAGCTCTACCGGAGAGGAGGTGCGAGATCCTCGTCGTGCTCTACCAATATCAATAATGGCCACGCAAGTGATAGTGTTCCTGGCCTACGCGGGCGTGTCCATCGTCGTCACTATGATGATGCCTTATTACCTACAGGAGACGGTAGCCTCAGTGGCAACGTCATTCGCCTACGTGGGCTGGGACTGGGCGCGGTGGTTCGTCACAATCGGCGCAGTGATCGGCATATCAGCCAGTCTGTACGGCTCGATGTTTCCTCTCCCTCGTCTGCTGTACTCGATGGCATCAGACGGTCTTCTGTTCCACTGGCTATCCAAAGTGACTTCAAAGAGGAAATCACCAACCGTCGCCACTATACTGTCCACGGTGGTTATTGCTATACTAGCGGCTTTATTGGAGTTGAATGATTTGATTTTGATGATGTGTGTTGGGACACTTTTGTCTTACACGATCGTCGCTTCCTGCGTTATTCTATTGAGGTACCGTTCGAACACATCATCGCACGAATTGACGGCTAAGCATGTTGTTGGTAACGGTCGTAGGTTACCGACGCGGACGACATCAACTATTGTTGTTACATTACTGCTATTGTTCATTTGTGCGTGTGTGTCGTCTGCGGTGGTCGTGACCCACGTGTCGGAGCCTCTGGTCGGCGCCTGTACCATTCACGCGGCGGGTCTACTACTGATTGTTGCCATGGCGCTGCAGCCGCAGAATGACGAGGACTTAACCTTTCAGTGTCCCCTCGTCCCCATGATTCCTTGCATCAGTATATACGTCAACATACATCTGATGATTCTCATAAAACTTCAGACCTGGATACGTGTTTTGTGCTGGATAGCAATCGGTATACCAGTTTATATATTCTGTGTCTGTTGTTATAAGCAAAAAGTGGAAAATTACACAGAAAACAATACAAACTCGACACAAGCGAACGGTAAGGCACCGGTTCAGATATTCGTTGTGTCACCCACACCTCCCGCTACTATCGGACGTAGTAACCAAGGAGGCGGGTACGATCACGATGACGACATCCGAGTGGACTTGAAACAAAATATTATACACATTAAAGAACCGGTTATAATGGAAGAGATAATAGTTCAACATGCTTACATCGGGGACAATGAAGAAAAGGAAGCCAAGATTATAGATTTGCTGGACCAGGTGTTACAGGCTGAAGAGGATTCTTATGAAGAGATAATAAGTTTGAAAGAACATAAGGAAGAACCTCAAGAAGAAGTTAAAACTCCAGAAATTAAAACGCACAGAAAATCGTTGAGTGAATTATCTGACGCTGGGTCAGATGTATCTAACCAAGTGCTATCTAAATACGATGTGATTGCTCAAGTTCATAGAGAAGATTTGCAAAAACTCACAGAAGAAGAAGAAGAGAAAAGCGATAAAGAAACCGAAAGTAGCGATAATGAAATTCTCGAGAACGAAGATCTTAATTGCAATGGTAGCGATACGACCTCTAGGACAGACGAATCTGGTTATTCGGATACACTCGATAAAACAGTTCTCGGTGAATCCACAGAAGATTTAAAAGAAGCAGAGGAAATACCAAACATACCAGTCCCACCGCCTTTAGACGAAAACTTCTTCGCTAGTCCAACATTTAAAAAATCATACACGATATCTGTTAGACCACCGAAAAGACAAGTCGAACAAGAAGAAAACAAGCCCAGGGAGAGTGTTGATTCGAATCATTCCTACGACGACGCACCGATGGTATTTGGAAGCGATAAGCAAATGAATTTCATGTCCAAATTAGAAAACATCTTTCAAAGTAAAATGGCGAATGACAATGAAGAAGACCCCCATAGAAAAAGATCAAATTCTACCGGTAATGTAGTGGATAGTCCGAATTCACAGCCAGTCCGACCGGTAATGTTGCTTGATTTGAAAAAAGAAATTGTTTCAAGAGATGTGGCTCAGAATTTACGTCATGTTGATGCAGAAGAGAAAAAAGAATCTTCTGAGGACGAGGAAGAAGATGTCAGTATGAGCCGCGAAGATCTAAAATCCAAGTTAGAAAATATATTTGCAGTGGGTGGACCGCAATTAATTAAAAGTAGATTAATGAAATCCAATCCTCCTACACCTGAAGAGGCTTACCATACCGACGCATCAAGTACAGAGAGTATACCTAAAGTGCCTAAAAAGGAAAAGAATGACACATTAAAAAGACAAAAGACCAAATTTGGGGAAGTTCTGAATTCTTTACGGATGATGAACAATGATGATAAAGTCTAG

Protein sequence:

>DPOGS201196-PA
MAVKWRDLLCTLKRRRVFEPDQLDVGNLRRCLSVWDLTALGVGSSLGVGVYVLVGSVALHLAGPSIVLSFLIAAVAAVVAAMCYAELGSRVPKAGSAYIYTYVTVGEIVAFIIGWNMILELVFGTASVARGLSMYVDSVTNKTMSSWMESLVPIHSDYFSSYFDIFSFFVVVFLGVLLAVGVRESTFVNNLLTAVNILVIVFIICAGAFKADFSNWNIPPSEVPSGRGVGGFFPYGIWGTLRGAALCFYGFVGFDSISSTGEEVRDPRRALPISIMATQVIVFLAYAGVSIVVTMMMPYYLQETVASVATSFAYVGWDWARWFVTIGAVIGISASLYGSMFPLPRLLYSMASDGLLFHWLSKVTSKRKSPTVATILSTVVIAILAALLELNDLILMMCVGTLLSYTIVASCVILLRYRSNTSSHELTAKHVVGNGRRLPTRTTSTIVVTLLLLFICACVSSAVVVTHVSEPLVGACTIHAAGLLLIVAMALQPQNDEDLTFQCPLVPMIPCISIYVNIHLMILIKLQTWIRVLCWIAIGIPVYIFCVCCYKQKVENYTENNTNSTQANGKAPVQIFVVSPTPPATIGRSNQGGGYDHDDDIRVDLKQNIIHIKEPVIMEEIIVQHAYIGDNEEKEAKIIDLLDQVLQAEEDSYEEIISLKEHKEEPQEEVKTPEIKTHRKSLSELSDAGSDVSNQVLSKYDVIAQVHREDLQKLTEEEEEKSDKETESSDNEILENEDLNCNGSDTTSRTDESGYSDTLDKTVLGESTEDLKEAEEIPNIPVPPPLDENFFASPTFKKSYTISVRPPKRQVEQEENKPRESVDSNHSYDDAPMVFGSDKQMNFMSKLENIFQSKMANDNEEDPHRKRSNSTGNVVDSPNSQPVRPVMLLDLKKEIVSRDVAQNLRHVDAEEKKESSEDEEEDVSMSREDLKSKLENIFAVGGPQLIKSRLMKSNPPTPEEAYHTDASSTESIPKVPKKEKNDTLKRQKTKFGEVLNSLRMMNNDDKV-