Monarch geneset OGS2.0

DPOGS214806
TranscriptDPOGS214806-TA1815 bp
ProteinDPOGS214806-PA604 aa
Genomic positionDPSCF300059 + 54634-68693
RNAseq coverage4x (Rank: top 89%)
Annotation
HeliconiusHMEL0221373e-8945.00% 
BombyxBGIBMGA012111-TA1e-12049.03% 
DrosophilaCG2930-PB3e-7335.91% 
EBI UniRef50UniRef50_Q16YY53e-7736.69%Oligopeptide transporter (Fragment) n=1 Tax=Aedes aegypti RepID=Q16YY5_AEDAE
NCBI RefSeqXP_001843841.19e-7938.11%oligopeptide transporter [Culex quinquefasciatus]
NCBI nr blastpgi|3407161581e-7733.33%PREDICTED: peptide transporter family 1-like [Bombus terrestris]
NCBI nr blastxgi|3407161582e-7732.99%PREDICTED: peptide transporter family 1-like [Bombus terrestris]
Group
Gene OntologyGO:00160207.3e-29membrane
GO:00068577.3e-29oligopeptide transport
GO:00052157.3e-29transporter activity
KEGG pathway 
InterPro domain[136-539] IPR0001097.3e-29Oligopeptide transporter
Orthology groupMCL21052 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214806-TA
ATGAAGTTGTCCGTTAAGTTCGATGAGAGGGAAGAATTAGATGGAATATCGCAGGGGCGAAATATTATAAGTTCCAACTTTTTCGCGATTTCATATTGCGATGTATGCCGTCCAATCATAGATAGTAGCATGTACCTTACATTTTCATTCCCGTGTCTCTTGAAAGTCTGGAGTTGGCCAAAAGCAGACCGAGCTTTTATGACGTCAGACCAAACAATATCGCTAGTAGCTTCCTGCAACCGTAGTAAGTCGATGTCACCAAATTTGTACTGCAGAAGAATTCTGGATTTCATCTTCGGGGAAGATAACAATGCAGTGCAACAAGCGAAACCGGTCACACATAGATTTCCAAAAACAGTCATCGTTGTTCTTCTAATCGTTGTCTTCGAACGTTTTTCATTTTCGCTGAGTAGAGTTATATTTGTCAGCGCCAAGAATTGGTATATTATGAAATCGGAGGGTGATAAAGTCATTCGTTTCGTGAAATGCATATTGTTGGGTGTTAGGAATAAAGTTCTCAGAAAGAAAAGTGAACATCAAAATTGGCTAGATTCTACTAAACCTACGTATGATAGAGGGTTTATACAGGATGTTAGAAAGACAATTTCTATCTTAACAATGTTCACTGCTCTTCCAATGTTTTGGGCACTATTGGATCAATTGGGTTCGAGGTGGACGCTACAAGCCACTAAAATGGACGGACGATTTGGATATATTACAATCAAACCAGACCAGTTGCCTGTATTTGATACTATGTTTATTCTCATTCTCATCCCATTCTCACAACAGTATGTATACCCGTTTCTGACGACACGTTCGATACTCACAAATCCACTACACAAGCTGACTTTAGGCGGAGTGTTGGCAGCGCTTGCATTTGTTTTCTCCGGAATCGTAGAGATTTATATTAAACCAACTTACCCTGTACTGCCTGAACCTGGATTTTCACAACTCCGTATATATAATGGCAATCCCTGTTCAATATCCGTTCAAAGTGACCAGCAAAATATTATGTACACTATTCCATCTCTTTCACATTTCACAAACAAACAAATGAATGTGAAAAATACGACGGAGGTTCGATTCAGATTTGACGGGAGTTGCATCGAAGCCAAAGATGAAATATTTTCTTTGGAAGACAATATGGCTATATCGTTTTTTATATCTGGCAAAAATATTGAAAGGTTTAAAGAGAATGTAGATAAAAGTAAGTCTGGGCTACCAGTTGTTAGATTCCTCCTAACAGATCATATAGATACTTCAAATTTGAGTCTTTTCAATGAAATTCATAAAGAAGTTGAGGTGTATCTCTCCGCCGGAATCAGTTCACAAATGGAAGTATTTATGGCAGAATATACTATAAGAGATGGTGGTACCATAATTGCTCGAGAAATAAAAATGGATAATGGTGGAGTATATAATATTATAATGGAGAAAGTTGATGACAATTACGAAACAAATGTGATAACAATAACGCAACCCAACTCTATAACAATGGCGTGGCTGCTGCCACAGTTACTGATGATCGCTATGGCCGAGGTGTTGTTTGCGATTACCGGTAGTGAGTTCATATTCAAAGAGGCTCCGAAAAGCTTGAAGTCAGTGATGACTGCCGCCTGGTTGATTATTGAAGCTATAGGGAACATTATCATCATTGTTATTACAAGAATTTTTATCGATTATCCACAGGAAACTCAAACGTTCATTTATGCTGGATTAATGTGCATTTCGATATTGATATTTCATTTACAATCTAAGAATTACCAGTTTCGTACTTGTGACGAATTTAAGGCCGAAGAATATAGTGAGCAGTAA

Protein sequence:

>DPOGS214806-PA
MKLSVKFDEREELDGISQGRNIISSNFFAISYCDVCRPIIDSSMYLTFSFPCLLKVWSWPKADRAFMTSDQTISLVASCNRSKSMSPNLYCRRILDFIFGEDNNAVQQAKPVTHRFPKTVIVVLLIVVFERFSFSLSRVIFVSAKNWYIMKSEGDKVIRFVKCILLGVRNKVLRKKSEHQNWLDSTKPTYDRGFIQDVRKTISILTMFTALPMFWALLDQLGSRWTLQATKMDGRFGYITIKPDQLPVFDTMFILILIPFSQQYVYPFLTTRSILTNPLHKLTLGGVLAALAFVFSGIVEIYIKPTYPVLPEPGFSQLRIYNGNPCSISVQSDQQNIMYTIPSLSHFTNKQMNVKNTTEVRFRFDGSCIEAKDEIFSLEDNMAISFFISGKNIERFKENVDKSKSGLPVVRFLLTDHIDTSNLSLFNEIHKEVEVYLSAGISSQMEVFMAEYTIRDGGTIIAREIKMDNGGVYNIIMEKVDDNYETNVITITQPNSITMAWLLPQLLMIAMAEVLFAITGSEFIFKEAPKSLKSVMTAAWLIIEAIGNIIIIVITRIFIDYPQETQTFIYAGLMCISILIFHLQSKNYQFRTCDEFKAEEYSEQ-