Monarch geneset OGS2.0

DPOGS210149
TranscriptDPOGS210149-TA1833 bp
ProteinDPOGS210149-PA610 aa
Genomic positionDPSCF300465 - 36061-52250
RNAseq coverage277x (Rank: top 39%)
Annotation
HeliconiusHMEL0074757e-15753.14% 
BombyxBGIBMGA010161-TA5e-14850.83% 
DrosophilaCG4797-PB4e-5127.60% 
EBI UniRef50UniRef50_Q7PX652e-10840.43%AGAP001236-PA n=1 Tax=Anopheles gambiae RepID=Q7PX65_ANOGA
NCBI RefSeqXP_321919.44e-10940.43%AGAP001236-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479655597e-10840.43%AGAP001236-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479655592e-10741.33%AGAP001236-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00550854.2e-46transmembrane transport
GO:00160214.2e-46integral to membrane
GO:00228574.2e-46transmembrane transporter activity
KEGG pathway 
InterPro domain[100-574] IPR0161964.4e-52Major facilitator superfamily domain, general substrate transporter
[144-575] IPR0058284.2e-46General substrate transporter
Orthology groupMCL19581 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210149-TA
ATGGAGAAAGGAATAGCGTCTGTCAGCAGAGCGAGGGTGGTTTTGTCTCAGGTTGTAGCATGTTCCGCATTAAACGTGCTGTTGGTTGGTCTTGGAATGTCGATGAGTTTTGTGACCATGGTGCTGCCAGAAGTCCTCGACGCTAAAGAAGGATTGTCAATCAATAAGAATCAGGCTTCGTGGTTTGCCATTCTGGTCGCTCCACGATCTATCTCTTCTTATCCCAAATTGTGGTGGCTCAATATCAGTTCCATAAAATATAACACGTCTCATGACCGATTGATAGATTTGGCATTTGAACTTAAGGCGCGTCCGTGGGTTGTAGCATGTTCAGCATTAAACGTGCTGTTGGTTGGTCTTGGAATGTCGATGAGTTTTGTGACCATGGTGCTGCCAGAAGTCCTCGACGCTAAAGAAGGATTGTCAATCAATAAGAACCAGGCTTCGTGGTTTGGAAGTATGGCATTTCTATGTCAGCCTTTGGGGAGTATATTTTCTGGTCCACTATTGGATTACTTCGGGAGAAAGAAAGCTCTATTCCTCGTCAATATACCGCATCTATTTGCATGGTTGATGATGTATTTCGCGTGGGACGTCCCTAGCCTGTTTCTGGCCAATGCTTTTCTAGGCATCGGCATCGGCATTATGGAAGCACCGTCTATTACTTACGTTGGCGAAGTCAGTGATGCCTCTCTGCGTGGGACGCTTACAACTTTGACAAACGGTTTCACATCAGCTGGTATGTTCATGGCTTACCTCCTGGGAACAGTTGTGTCATGGCGCGAAGCAGCACTCGTTTCGCTCACTGTACCTCTTGCTACCATGTTATTAGTCCTTTTTGTCCCTGAAACTCCTATTTGGTTACTATCAAAAGGCAGGCAAAAAGAAGCACTGGTTTCGCTCTGTCGTCTTCGGGGTTGGGTTGAACCGGAGGATGTTAAAGAGGAATTTAACCAGTTAGTGGAATATAGTAACAACATAAGCAGATGTGTTCTGTGCACCAAAGTACAAGAGCTGGATAGTAAAATCTGTAAACATTCATCTTATAACTTTATGAAGAGATACATTCTTAGACTGAAGCATTTACTTTTTGTGAAAGAGACTATGAGACCGTTCGGATTAGTCATGGCGTATTTCTTTTTTTACACCATGAGTGGTCTCTTGCCTGTTAGACCTAACATGGTGAACGTGTGCAAGGCTCTGGGTATGAAATTTGACTCCAAAGCAATTGTGGTCAGCGTCGCATTGGTATATATTGTGATGAATATCGTATCAGCTGTCGTAGTTAAGATATTTGGGAAGCGTAAATTGATCTTATCATCACTCTTCGCTTCAGCTTGTAGCAGTCTTGCTTTGAGTATATATGCGGGAGTTGTGCTGCCGGTTAGCGTATTTTCATACGAACCGAGTACATTTCCAAGTCAAACGGAAATTATTCCTGTTATACTATTTATGTCGCTAGTGTGTTTCACCAGCTTAGGTATACCATGGATCCTACTCTCCGAAGTCTTTCCTTTCAGGAGTCGTGGTATGGCTACGGGTTTGGCTGCCGCTTTAAGCTACCTCATTTTCTTCGCAGCAGCAAAATCCAATTACAACATTGAGGAAAATTTCCACATGAGCGGCTCCTTTATGACTTATGCCATACTGGGTTTTATGGGCACGGTATATTTGTACTTCTTCCTTCCGGAAACTGAGCGAAAAACGTTAGCTGAGATTGAAGCGTTCTATAACGGGAAGTCGAAAATATTTGCAAATGATTTCGTTATAAACGCTTTCAGAAAAACAAAAATTGAGACGAACGGAGCCGACAAACCGATGCTAGACTGCTGA

Protein sequence:

>DPOGS210149-PA
MEKGIASVSRARVVLSQVVACSALNVLLVGLGMSMSFVTMVLPEVLDAKEGLSINKNQASWFAILVAPRSISSYPKLWWLNISSIKYNTSHDRLIDLAFELKARPWVVACSALNVLLVGLGMSMSFVTMVLPEVLDAKEGLSINKNQASWFGSMAFLCQPLGSIFSGPLLDYFGRKKALFLVNIPHLFAWLMMYFAWDVPSLFLANAFLGIGIGIMEAPSITYVGEVSDASLRGTLTTLTNGFTSAGMFMAYLLGTVVSWREAALVSLTVPLATMLLVLFVPETPIWLLSKGRQKEALVSLCRLRGWVEPEDVKEEFNQLVEYSNNISRCVLCTKVQELDSKICKHSSYNFMKRYILRLKHLLFVKETMRPFGLVMAYFFFYTMSGLLPVRPNMVNVCKALGMKFDSKAIVVSVALVYIVMNIVSAVVVKIFGKRKLILSSLFASACSSLALSIYAGVVLPVSVFSYEPSTFPSQTEIIPVILFMSLVCFTSLGIPWILLSEVFPFRSRGMATGLAAALSYLIFFAAAKSNYNIEENFHMSGSFMTYAILGFMGTVYLYFFLPETERKTLAEIEAFYNGKSKIFANDFVINAFRKTKIETNGADKPMLDC-