Monarch geneset OGS2.0

DPOGS209501
TranscriptDPOGS209501-TA1215 bp
ProteinDPOGS209501-PA404 aa
Genomic positionDPSCF300127 - 63881-67738
RNAseq coverage50x (Rank: top 70%)
Annotation
HeliconiusHMEL0160172e-8552.30% 
BombyxBGIBMGA007344-TA6e-12959.36% 
Drosophila% 
EBI UniRef50UniRef50_UPI00020614EA2e-9746.06%UPI00020614EA related cluster n=1 Tax=unknown RepID=UPI00020614EA
NCBI RefSeqXP_972125.13e-10247.17%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|3504239682e-10247.60%PREDICTED: putative amino acid permease F13H10.3-like [Bombus impatiens]
NCBI nr blastxgi|3504239685e-10647.36%PREDICTED: putative amino acid permease F13H10.3-like [Bombus impatiens]
Group
KEGG pathwaymcc:7147061e-06 
 K08653 (MBTPS1)maps-> Protein processing in endoplasmic reticulum
InterPro domain[1-396] IPR0130572.6e-22Amino acid transporter, transmembrane
Orthology groupMCL17374 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209501-TA
ATGGGGTCATCTCTTCTGACGATGGCGTGGGGCGTGGAGAGAGCTGGTCTCCCGGCCGCCCTTCTCTTACTGACAGTGATGGCTGGACTCTGTCTCTACACCGCGTACATCTTAATAAGAGTCAACGCACACCACGGTAGTTCTACATGTGAGGTTCCCGCCTTGTGTCTGTTACTGCTGGGTCGCTGGTGGTCCAGAATCGCCCACGCCTTCAGTCTCTTAGTCCTGTTGGGAGCGACCCTCGTCTATTGGCTGTTGATGGCCAACTTCTTATTTTACACCGTCAATTATTTCATGGATGTTTCCACATCAAACCAGACGACATACGATACCAATCTAGTGTGTCCAAAGCACGAGGTTTTCATGACGTCACCCCAAACCCCTCAAGACCCCTCTCCCTACTGGGGTCTACACACCACCGTGCCTGTTTATGTCGCACTTATAATGTTCCCATTGCTGTGCTTCAAAAACGTCACATTCTTCACTCGCTTTACTTCTTTTGGTACACTGTCAGTGATGTACCTCCTGTTGTTCGTGATAGTTAAGGGCTGGATGTGGGGAATCAATTTGAAGTCGATAGACATTAAATGGGAAGGTCTCCATGGCGCGAGGAACGCGGCCGTTCTCAGCGGGATGCTGGCTCTATCATTCTACATACATAATATTATCATAACGATAATGAATAACAACGCCAGACAAGAGAAAAACGGTCGTGACTTGACGATAGCATTTTTGTTGGTGACTGTAACATACACGCTCGTCGGCACGGTCTTCTTCATCTGTTTTCCGTTGGAGAAGAACTGTATTGAGGACAATATTCTCAACAATTTTGAAAAACACGACGTTATGACTGTGATTGCTAGAATGCTTCTTCTATTTCAGGTGATGACAGTGTACCCGCTAGTGGTGTGTCTGGTGCGCGCGGAGGCCGCTCGCCTGCACTCTCTCGCCTCAGCTCGCGGCGCCGACCTCGTCATCAACGGGAGCGTGGTGGTGGCGTGTATAGTCGTCGCGTGCATTTGTCCCTACATAGGGACTATCATACGTTACACGGGAGCCGTAAGCGGTCTGGTGCACGTGTTCGCGTTGCCCTCGCTACTTCACATGAGGTCGCTGCAGCTCAGAGGAAAACTAAACTCAATTAAAATTATTTTTTACTTCGCCATAATAATGTTCGGGACTGTAAACTTATTGATGCAGTTCTTTATACCATAG

Protein sequence:

>DPOGS209501-PA
MGSSLLTMAWGVERAGLPAALLLLTVMAGLCLYTAYILIRVNAHHGSSTCEVPALCLLLLGRWWSRIAHAFSLLVLLGATLVYWLLMANFLFYTVNYFMDVSTSNQTTYDTNLVCPKHEVFMTSPQTPQDPSPYWGLHTTVPVYVALIMFPLLCFKNVTFFTRFTSFGTLSVMYLLLFVIVKGWMWGINLKSIDIKWEGLHGARNAAVLSGMLALSFYIHNIIITIMNNNARQEKNGRDLTIAFLLVTVTYTLVGTVFFICFPLEKNCIEDNILNNFEKHDVMTVIARMLLLFQVMTVYPLVVCLVRAEAARLHSLASARGADLVINGSVVVACIVVACICPYIGTIIRYTGAVSGLVHVFALPSLLHMRSLQLRGKLNSIKIIFYFAIIMFGTVNLLMQFFIP-