Monarch geneset OGS2.0

DPOGS207447
TranscriptDPOGS207447-TA1932 bp
ProteinDPOGS207447-PA643 aa
Genomic positionDPSCF300051 - 428319-439056
RNAseq coverage1794x (Rank: top 7%)
Annotation
HeliconiusHMEL0148560.076.06% 
BombyxBGIBMGA001190-TA0.072.88% 
DrosophilaSP1173-PA5e-10632.67% 
EBI UniRef50UniRef50_UPI00015B50946e-13640.84%UPI00015B5094 related cluster n=1 Tax=unknown RepID=UPI00015B5094
NCBI RefSeqXP_001605410.11e-13640.84%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3071770623e-14543.02%Macrophage MHC class I receptor 2-like protein [Camponotus floridanus]
NCBI nr blastxgi|3071770629e-14843.22%Macrophage MHC class I receptor 2-like protein [Camponotus floridanus]
Group
Gene OntologyGO:00550851.3e-08transmembrane transport
GO:00160211.3e-08integral to membrane
KEGG pathway 
InterPro domain[1-599] IPR0161962.1e-28Major facilitator superfamily domain, general substrate transporter
[319-570] IPR0117011.3e-08Major facilitator superfamily
Orthology groupMCL15664 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207447-TA
ATGAATCAGCTCGTGAATAAGAATCTCATTACTCTTAAGTGCGTGCTCTTCTGTTTCCTCGCCGGTATAGGATGTATCTTCCCATATCTTCCGCTGCACATGCTGCACATTGGTTTGGACCGCGGTGAGGCGAGACTTGTGTCAGCGATCGCGCCCTGTATCGCACTACTTGGACCCGCCATACTTGGACCGCTTATTGACAAGCTATCAATAGGTCGTGGCTCCACCGGCGGTGGGACCGGGCCGAGCGGCTCTGGTCGCCTACTCCGTATTGTAACAGCTGTCTGTTTAATACTGAGCGCTGTCTTCTACACGCTCCTGCTGGCTGTGCCTTACACTGAACGACATGAAGCTCGCCGGCCTCAAGTGCTGTTCATGTGTGACGCGTCTGGTGCGTACGTGATGCAGGAGGTCTGTGGAGAAGGGATGCAGTGCAAACGTTGGCAGGGAGAAAAGTCTGGTGTGTTGGCTGTGAGTGCGTGCGAGTACGGCTGTGCGGATGACAACCTCACCTGGGTGATGCGGCCATTCACCACAACATCGACCACCACCACCCTCAGCCCCATGTACAACAGCGTTGCCAACGCCACTACCCCTTCAGATCTGGTGACCGAGGAACCGGAAGACGAGGACTACTTCGAACTGAACCCTCCACACCTGTGCTACAACGGACAGTGCTTAGTGTACATGCAACATTCCGCCAGGCTGAGAGTTCCGCTGTCACTGCTGGCACCAGAACCGCCAGGAGAGAACTCCACAGTCGAGAACAATTGGTGCACATATCGGACGGGTGGTGCGTCAAAGTGTTTGGTACCGCCGAGTCGTTTGGCGGAGATCTCAGTGGAGGGTGAAACGTGTAAGCCCGCGGTCAGGTGTCAGGTCATGGATCCCTACGACGAACCGGACGGAGTGTTGGCGGACGCTGAGTGTAGGCTCGTTGTCGGAGAACCCACCACCACCTTCTGGACTTACCTCGTCATCAGGGTGTTGGCGGATATATGGCCAACCGCCGGGCTGGCGTTACTAGGCGCGGCCTGCGTGATCGCTACTAGAGAGACTTCGCTGGGTCGTGGCGATGTTGGCCGGCAGTTGGCGTTCGGGACACTAGGGTTGGCCATATTCCCGCCACTGGCCGGGTACGCCGGCGAACAGATGACGGAATCTCCGTACCTGGTCCCCTTCCTGTTGCACGCCGTGTTCATGGTCATTGGAGCTCTCATCCTTTTGTGCGACACTCACATGCCGCTGTCGACCCCTGAGTGGTGGTGGCACACAGCGACCGGGGTGCTGGCGCTGCCGATGTCGGCTGTCCGGAGATACGGCGCTGAAACAGCAGCCGTTAGCGCCGTGCTGGTGCTCCTGGGAACACTCTGGAGCGGCATCGACGCTTACTTACCATGGACCGTGTTCCAACTCAACGGAACCCTAACGGAGGTTGGTTTGACTTTGACTGCCGGTTCGCTGCCCGCCCTGCCGGCGCTGTTCTGGGCGGAGGCTCTCGTGGACTACGTGGGACATTCTAACCTCTTCATCACAGCCTTCACCTTCTACTGCCTACGATACACGGGTCTAGCATACGGTGACTCTTACACCTGGATCGTTGTGTGCGAGCTGCTGGAGGTGTTCACTCTCAGCCTGGTGTGGGTTACGGCCATGTTGTACTTCCGACATCTGGTGCCCAGAAAATACACCACCACTGGCCAAGCGCTGCCCGTTATAGCACATTTCTGTATTGTGCTGGCGTTGTTGGTGGCGGCGGTCTACCTCGCGCTGTACCACCTCCTGCTGGCGCCGCGCTGCGCCTCGCCAGCTCAATCCCCTCCCAACCATCTCTTACAAGGCCTGAACACCAACGGCAGTTCGAACGGCAACTACTCTCCGATGCGAGTGTATCACGAGGAACGTTCCAGAAAGGGACATTTCCGCTATTAA

Protein sequence:

>DPOGS207447-PA
MNQLVNKNLITLKCVLFCFLAGIGCIFPYLPLHMLHIGLDRGEARLVSAIAPCIALLGPAILGPLIDKLSIGRGSTGGGTGPSGSGRLLRIVTAVCLILSAVFYTLLLAVPYTERHEARRPQVLFMCDASGAYVMQEVCGEGMQCKRWQGEKSGVLAVSACEYGCADDNLTWVMRPFTTTSTTTTLSPMYNSVANATTPSDLVTEEPEDEDYFELNPPHLCYNGQCLVYMQHSARLRVPLSLLAPEPPGENSTVENNWCTYRTGGASKCLVPPSRLAEISVEGETCKPAVRCQVMDPYDEPDGVLADAECRLVVGEPTTTFWTYLVIRVLADIWPTAGLALLGAACVIATRETSLGRGDVGRQLAFGTLGLAIFPPLAGYAGEQMTESPYLVPFLLHAVFMVIGALILLCDTHMPLSTPEWWWHTATGVLALPMSAVRRYGAETAAVSAVLVLLGTLWSGIDAYLPWTVFQLNGTLTEVGLTLTAGSLPALPALFWAEALVDYVGHSNLFITAFTFYCLRYTGLAYGDSYTWIVVCELLEVFTLSLVWVTAMLYFRHLVPRKYTTTGQALPVIAHFCIVLALLVAAVYLALYHLLLAPRCASPAQSPPNHLLQGLNTNGSSNGNYSPMRVYHEERSRKGHFRY-