Monarch geneset OGS2.0

DPOGS205643
TranscriptDPOGS205643-TA1404 bp
ProteinDPOGS205643-PA467 aa
Genomic positionDPSCF300023 - 99412-108404
RNAseq coverage862x (Rank: top 15%)
Annotation
HeliconiusHMEL0058614e-17970.31% 
BombyxBGIBMGA000844-TA2e-13248.90% 
DrosophilaCG8785-PB3e-10644.40% 
EBI UniRef50UniRef50_Q7K2W34e-10444.40%CG8785, isoform A n=18 Tax=Endopterygota RepID=Q7K2W3_DROME
NCBI RefSeqXP_001648128.11e-11649.05%amino acid transporter [Aedes aegypti]
NCBI nr blastpgi|1571037832e-11549.05%amino acid transporter [Aedes aegypti]
NCBI nr blastxgi|1571037835e-11449.05%amino acid transporter [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[58-457] IPR0130571.1e-71Amino acid transporter, transmembrane
Orthology groupMCL34617 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205643-TA
ATGGCTAATAGTTACAAGATGAAGGAGTTCAGCTCCACGGCTGTGATCACTGAAAATGCCGTCTTCCCGTCCACAATATCCATCAACACGATGAACACAAAGTGCAAAGAAACTGAGATAGAGGACGCGTACGATCCCTTCCAGAACAGAAAGCTGGAGCATCCGAACTCTGACGTCCGTTCGTTTGCAAATCTTCTAAAATCATCGCTCGGTTCAGGTATCCTAGCGATGCCGGCAGCATTCAAGAACGCAGGCACTGTCGTCGGTATATTTGGCACAGTAATACTGGGATATATTTGCACTCACTGTGTTTATTTATTGGTGAAAACATCACAAGATGTATCAAAAGTGACGAAAGTTCCATCGCTTGGATACGCTGAAACAGTGGAAGCTGTATTTGCTACAGGACCCCAGCCTCTTCGAAAACTATCAAGAGCTATGAGAATATTTATAGATTGGGCGATGGCCTTCACAATTCTTGGCGCCTGCGCGGTTTATGTCATACTAATAGTGGAATCCGTTAAACAGATAGTGGACCATTTCCATCCTGACAGTGGTATTACTACGACAATGTACTGCCTAATGTTTCTTGTTCCAATTCTAATATTCACACAGATAAAAAACTTAAAATACATAGCACCTTTTTCTGGATTTGCTAATGTACTTTTAGTGTTAACTTTTTTAATCTGTCTCTATTACATTTGTTCTGAATTCCCAAGTTTCGATTCACAACCAATGTCTGTAGAAATTGGTAAATTGCCGCTTTTTATCGGTACAGTTATATTTGCTATGGAAGGTATCGGTGTAGTGTTGCCAGTTGAAAACACCATGGCTAAACCTCAGCACTTTCTTGGTTGTCCCGGAGTCCTAAACATTACCATGGCAATAGTGGTCCTACTATATATGGTAATGGGAATTCTGGGCTATTTAAGATACGGTGACAAAGCTGAAGGCAGTATTACGATTAATCTTCCTACACAAGAAATACCAGCATTGATGGCTAAGGTGTTTATAGTCTTGGCTATATTTTTCACCTACGTTCTACAGTTTTATGTGCCCATGGAGATTGTTTGGCGCAACACGAAAGAAAAGGTCTCTCAAAAGTATCACAACCATGCTCAAGCCATTATAAGAGCCTTCTTTGCAGCTCTGACCGTCGTAGCCGCAGCCTCCTTACCAAAACTCGAACAAGTAATAGGACTAGAGGGTGCTTTCTTCTACTCTTTTCTTGGACTGGTGGCGCCCTCACTGATGGAAATCATTTTCTGTTGGGACAGAGGGCTTGGCAAATACAACTATATTTTAATAAAAGATAGTATTCTTGCCATATTTGGCATGTTTGTACTTGTTACCGGCGTTATGCAAAGTATTAAGGAAATTATCAGAACTAACGGATCTTTGTGA

Protein sequence:

>DPOGS205643-PA
MANSYKMKEFSSTAVITENAVFPSTISINTMNTKCKETEIEDAYDPFQNRKLEHPNSDVRSFANLLKSSLGSGILAMPAAFKNAGTVVGIFGTVILGYICTHCVYLLVKTSQDVSKVTKVPSLGYAETVEAVFATGPQPLRKLSRAMRIFIDWAMAFTILGACAVYVILIVESVKQIVDHFHPDSGITTTMYCLMFLVPILIFTQIKNLKYIAPFSGFANVLLVLTFLICLYYICSEFPSFDSQPMSVEIGKLPLFIGTVIFAMEGIGVVLPVENTMAKPQHFLGCPGVLNITMAIVVLLYMVMGILGYLRYGDKAEGSITINLPTQEIPALMAKVFIVLAIFFTYVLQFYVPMEIVWRNTKEKVSQKYHNHAQAIIRAFFAALTVVAAASLPKLEQVIGLEGAFFYSFLGLVAPSLMEIIFCWDRGLGKYNYILIKDSILAIFGMFVLVTGVMQSIKEIIRTNGSL-