Monarch geneset OGS2.0

DPOGS210156
TranscriptDPOGS210156-TA1095 bp
ProteinDPOGS210156-PA364 aa
Genomic positionDPSCF300379 - 178325-181870
RNAseq coverage11x (Rank: top 84%)
Annotation
HeliconiusHMEL0220592e-7644.05% 
BombyxBGIBMGA004081-TA8e-5844.79% 
DrosophilaCG30344-PA6e-4327.56% 
EBI UniRef50UniRef50_Q961H29e-4127.56%CG30344 n=22 Tax=Drosophila RepID=Q961H2_DROME
NCBI RefSeqXP_002089754.11e-4228.21%GE19261 [Drosophila yakuba]
NCBI nr blastpgi|1954749602e-4128.21%GE19261 [Drosophila yakuba]
NCBI nr blastxgi|1954749603e-4428.30%GE19261 [Drosophila yakuba]
Group
Gene OntologyGO:00550856.3e-14transmembrane transport
GO:00160216.3e-14integral to membrane
GO:00058864.5e-06plasma membrane
GO:00052154.5e-06transporter activity
KEGG pathway 
InterPro domain[6-313] IPR0161966.2e-20Major facilitator superfamily domain, general substrate transporter
[41-269] IPR0117016.3e-14Major facilitator superfamily
[67-87] IPR0019584.5e-06Tetracycline resistance protein, TetA/multidrug resistance protein MdtG
Orthology groupMCL27823 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210156-TA
ATGGTCAATAGCCTCAACCCATGGTGGCTACTTTTGACTTCCATCCCGTTTTCATTATCCGGAGGCAATGTTGTATTATTCACGGGTGCCTACTCCTTCATTAAGGATACCAGTTCTCCAATTGACGCCTCATTATGCCTCGGAACTCTTGTGGGCTCTTTGTCTAGTTCCTACCTAATAGTACACATGGGTTATTCTTACGTACTAATATTTACCGCCACAGTAAACGTTATTACATATTTGTTTGTAAAGATTTGGATCATAGAGTCCTTGTCTGGGGCTGAACAGGGCCGAGCGGGAAGTTTGTTTAACTGCTCGCATCTTAAAGATATGTTAAAAGGATGTTTTAAACATCGTCCTAACAAAGGAAGAAATATTATCATACTTATGGCAATAATAAAATTAGTTCTTATCACGGTACAAGTCGGGTGGAGTTATTTGGAATATTTATATTTGAGAAATAAACTGAATTGGTCTTTAAGGGTTTACACAACATATTCAGCTGTTAGCACAATAACAGCTTTTTTTGGAGCATTTCTTGGTGTCATGGTAATTGAAAGACTACTGCGGATTGGAGATATAACATTTGTGATGATTGCATTAACGACGGCAATTATAGATTACATGATAAAAGCTTTTGCAACGCAATGGTGGCAAATATATTTGAGCATATTTGTGTCACCATTCAAAGGACTTCCGTTACCATTAATAAATTCATATGTCAGTAAATTTTTGCCAGAAGAGGACATAGCGAAGGTGTTCGCCCTCCTTTGCGCGATGGAGAGTGTAGCGCAAATTATTGCACCTATTATTTTTAATTCCCTATACTCGTCTACATTATCTGTATTCCCTGGCGCCATCTACATTCTAAGCGCAGTTATGAATGTCATCTGTTTGGTTATGTTAATTCAGGTTGTTTCTAAAGTGAGTTCCCAAGAATTTGGTACCGAAACCCGGGAAATAAAATTGCTGTATGGGAGACACTGTGAAGATTTTTACAATAGAACACAGAAAGTAGAAAAGTTGCGAAGATATTTCGCTAATCATGAAATTAACCAGACTTTACAGTTCAAGATCATCAAGGAGAATGAATAA

Protein sequence:

>DPOGS210156-PA
MVNSLNPWWLLLTSIPFSLSGGNVVLFTGAYSFIKDTSSPIDASLCLGTLVGSLSSSYLIVHMGYSYVLIFTATVNVITYLFVKIWIIESLSGAEQGRAGSLFNCSHLKDMLKGCFKHRPNKGRNIIILMAIIKLVLITVQVGWSYLEYLYLRNKLNWSLRVYTTYSAVSTITAFFGAFLGVMVIERLLRIGDITFVMIALTTAIIDYMIKAFATQWWQIYLSIFVSPFKGLPLPLINSYVSKFLPEEDIAKVFALLCAMESVAQIIAPIIFNSLYSSTLSVFPGAIYILSAVMNVICLVMLIQVVSKVSSQEFGTETREIKLLYGRHCEDFYNRTQKVEKLRRYFANHEINQTLQFKIIKENE-