Monarch geneset OGS2.0

DPOGS203368
TranscriptDPOGS203368-TA1884 bp
ProteinDPOGS203368-PA627 aa
Genomic positionDPSCF300003 + 204441-209820
RNAseq coverage364x (Rank: top 33%)
Annotation
HeliconiusHMEL0135290.072.87% 
BombyxBGIBMGA003894-TA8e-16866.06% 
DrosophilaCG16700-PA3e-11548.49% 
EBI UniRef50UniRef50_E2B4G04e-13453.26%Proton-coupled amino acid transporter 4 n=11 Tax=Endopterygota RepID=E2B4G0_HARSA
NCBI RefSeqXP_001603709.15e-13152.50%PREDICTED: similar to GA14090-PA [Nasonia vitripennis]
NCBI nr blastpgi|3072143431e-13353.26%Proton-coupled amino acid transporter 4 [Harpegnathos saltator]
NCBI nr blastxgi|3072143431e-13353.26%Proton-coupled amino acid transporter 4 [Harpegnathos saltator]
Group
KEGG pathway 
InterPro domain[219-620] IPR0130571.1e-78Amino acid transporter, transmembrane
Orthology groupMCL16264 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203368-TA
ATGAAAGCCATTTATATCTTGAAATCTGTTATCTTAAATCATTTGAAGAAACTTCCAGTGGTGATTTTTCTCCTGCTATGTTCGTTGTTCCTCGTTAACTCGCAGGATGTGTCCGAAACCCGTGATCTTGCCGACGATGGTGAGGACTACGATGAACTTGGCCGCAAACATCGCAAGAAGAAAAAACATCAAAATAATCCTTGTTACGGAGGTTACGGGCGGACTTTTGGAGACAAATTCGGACATCCTGATCAAACGTATATTTCTGCTCCAGTCACTAACTACTTCTTTGGTTGCGGAGGTCCTGTGGTACCACAGCATCAAGGACATGGCCATGGCCACGGTGGCCACGGCCACGGCGGACACGGTCACGGTGGCCACGGTCATATAGGGTACGGCCAGCACGGTGGATTCAACGGACCCCTTAATTATGGATTCAATCAAGGATTCAATCAACCCTATCCATTTGCTCAACACCAATATCCGTTCAATGGTGGCTATAACGTGGCCTCCGCTGCAGCATCAGGTTTCGGCAATCTCGTTACGACGACTGATAGTTACGGGGAGTACCAGTCTCGTGAGCAGATCATTAACTTGGATAGTAAAGTAAGAAGCGAGGATGAGCCCAACCCTGAACATCCCCACGTCACACATCCGACATCCTATATGGACACGATGCTACATCTGTTCAGAGGTAACATCGGTTCTGGTTTGCTGGCTATGGGCGACGCTTTTAAGAACGGTGGCATTATTTTTTCACCTATCATGACAGCGATACTCGGTGTTATATGCGTGCACGCTCAACATTTACTGCTGAACTGTTCAGAGGAGATGTATAGGAAGACGAAAAGGGACAAGCCTCCGGGTTTCGCGGACACAGTGTCGCTGGTGTTCGAGTACGGACCGGTGACACTGAGGCCGCTCGCACCGACCATGAAAATCCTGGTGAACACATTTCTGTGCATCACACAACTGGGCTTCTGTTGCGTCTACATAGTATTCATAGCGAACAACGTTAAAATGATATGTGACCAGCGAGGTCTGCATATAGATTTAACAATACACATGATATTCGTCATAATACCTATATTACTGATCTGTATGGTGCGAAACCTGAAATATTTAACTCCGTTCTCGACTTTGGCCAACGTGATGATGGCGCTAGGAGTTGGCGCTGTTTTGTACGAAGCTGTACAGGACATACCGCCGGTGGAGAGCAGGGACTACATAGCTCACTGGAGCCAGCTACCCTTGTACTTCGGTACAGCTATTTACGCTTTCGAAGGCATAGGATTGGTGCTGCCATTGAAAAACGAAATGCGGAAACCAGAGCTATTCCAGAAGCCTTTGGGCGTTTTAAACTTGGGTATGGTGATTGTTGCTGGTATATTCGTAACGGTCGGCTTCTTCGGCTACCTCAAGTGGGGTGATGAGGTCGCTGGCAGTGTCACCCTCAACCTGAACCCCGCGAATGTTTTAAGCACAACGGTCCAGGTGTTGATAACTCTGGCTATGCTGTTGACTTATCCCCTCCAGATGTACGTCCCGGTCGCGATAATGTGGCCGCCTCTCAAGAAGAAGTACGGGAAGTCGTCACCCGTGGCCAAGGAGCTGGGATTCAGAGTTCTGCTAGTACTACTCACCTTCGTTCTGGCGGAGTCGATTCCACAGTTGGGACTGTTCATATCCCTGGTGGGAGCCATCAGTAGCACCACCCTCGCCCTCATGTTCCCACCGATCATACAGCTCGTGTCCCACTACCAGAACAACAACGGCCTGACAGTCTTTATCACAGTGAAGAATCTCTTGATCATATCGCTGGGGCTGTTTATCTTTGTCACAGGAACATATCAAAGTATAGCCTCTATCGTACAAGCATTTTAG

Protein sequence:

>DPOGS203368-PA
MKAIYILKSVILNHLKKLPVVIFLLLCSLFLVNSQDVSETRDLADDGEDYDELGRKHRKKKKHQNNPCYGGYGRTFGDKFGHPDQTYISAPVTNYFFGCGGPVVPQHQGHGHGHGGHGHGGHGHGGHGHIGYGQHGGFNGPLNYGFNQGFNQPYPFAQHQYPFNGGYNVASAAASGFGNLVTTTDSYGEYQSREQIINLDSKVRSEDEPNPEHPHVTHPTSYMDTMLHLFRGNIGSGLLAMGDAFKNGGIIFSPIMTAILGVICVHAQHLLLNCSEEMYRKTKRDKPPGFADTVSLVFEYGPVTLRPLAPTMKILVNTFLCITQLGFCCVYIVFIANNVKMICDQRGLHIDLTIHMIFVIIPILLICMVRNLKYLTPFSTLANVMMALGVGAVLYEAVQDIPPVESRDYIAHWSQLPLYFGTAIYAFEGIGLVLPLKNEMRKPELFQKPLGVLNLGMVIVAGIFVTVGFFGYLKWGDEVAGSVTLNLNPANVLSTTVQVLITLAMLLTYPLQMYVPVAIMWPPLKKKYGKSSPVAKELGFRVLLVLLTFVLAESIPQLGLFISLVGAISSTTLALMFPPIIQLVSHYQNNNGLTVFITVKNLLIISLGLFIFVTGTYQSIASIVQAF-