Monarch geneset OGS2.0

DPOGS210839
TranscriptDPOGS210839-TA2388 bp
ProteinDPOGS210839-PA795 aa
Genomic positionDPSCF300027 + 81677-85887
RNAseq coverage1275x (Rank: top 10%)
Annotation
HeliconiusHMEL0213040.078.49% 
BombyxBGIBMGA003915-TA0.070.12% 
DrosophilaCG3409-PB9e-8045.66% 
EBI UniRef50UniRef50_F4WBF50.042.86%Monocarboxylate transporter 14 n=8 Tax=Formicidae RepID=F4WBF5_ACREC
NCBI RefSeqXP_393553.30.047.69%PREDICTED: similar to CG3409-PA [Apis mellifera]
NCBI nr blastpgi|3407170050.046.14%PREDICTED: hypothetical protein LOC100645876 [Bombus terrestris]
NCBI nr blastxgi|3320284680.043.31%Monocarboxylate transporter 14 [Acromyrmex echinatior]
Group
Gene OntologyGO:00550853.7e-24transmembrane transport
GO:00160213.7e-24integral to membrane
KEGG pathway 
InterPro domain[92-770] IPR0161968.1e-68Major facilitator superfamily domain, general substrate transporter
[141-319] IPR0117013.7e-24Major facilitator superfamily
Orthology groupMCL10863 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210839-TA
ATGGACACGCAAAGAACTCATGCAAATGGAATCCATGGCAAAGAAAAGCTAATCAATCATGAACATAAAATACCGTCTGAAGAGGAGAAGACTTCGTCTCCGCTGCTCGTTAAGACGAACGATTTTAAGTACATAGATTCATATAGTGAAGAAAGACATGACTTGAGCGAGGTTAACAAGTCTGGGAATGATCTCAATACCATAAATTCAATGAAGACGGAAGACGATGTGCAGTGTAACAGCTTGACTGTGAGTTATCACAATGACAAGAATGATGATGACGCCAAGTCGAACAACGAAAGGGTCAAGTTTGTGGGCCTGGAAGATGCTGATAGTATTTGCTCCAGTAAACAACCAGAACCGGGACCTCAAATACCTGATGGCGGTTGGGGTTGGGTTGTTGTGGCTGCATCCTTCCTAATCGCCACTGTGGCTGATGGACTGGCATTCTCATATGGATTGCTTCAAATTAAATTTGTCGATTTCTTTGAGAAAAGTGAAGCGAAGACTTCTTTGATTGGAAGTCTCTTTATATCGGTGCCTTTAATAGCGGGTCCCATAATGAGCGCTTTGGTAGATCGTTATGGATGTAAGAAAATGACAATTGTGGGGGGTATAGCTTCTACTATAGGTTTTGTTGCGGCATCGTACAGTAATACCGTCGAAGCTTTGTACATTACCTATGGTATAGTAGCTGGATTGGGCATGGGGCTTTTATATGTCACGGCAGTAGTTTCTATTGCGTACTGGTTTGAGAAGCGTCGTAATTTGGCTGTTGGATTGGGATCTTGTGGAGTCGGCTTTGGAACATTTATATACTCTCCTCTGACCACATATTTACTGGATGAATACGGCTGGAGAGGCGCCTTGTTATTGCTGGCTGGCACGGTACTTAACGTTTGTGTATGCGGTACAGTTATGAGGGATCCTGAATGGCTTATCTTAGAACAAAAGAAGCAGAGAAAACTAAGTAAATCGAAGAGAGGATCCAGCTCTACGTCGATATCTGCCAAATCCGGCGGTGCTGAATCCGTGTATCCCGGTGCTGAGGAACTGAAAAGCTTAATTAATAGTGGCGAAGCACCAGAGCACATACTCTCTGCCCTGATATCAGCTATTGCCGAGGCTGAAAATGTAGAGGCAGTGACGAAAATGAACGCTGATTTAACGCAACAACAGAAAGTCAGTTCTGTTTTAAATTTACCAACGTTTTTGAAACAAAGTGAAAGGTTGCCTCCAGAAGTTATAGATCAACTTGCGACTAACGAGCGGCTGTATAACATAATACTTCAAAATTATCCGTCGCTATTAGCTTTGAGGACTGGCTCAGATTCCAAACTTACCTCAGAACCGCTTTCAAAGGAGAAGCTTAAAAAGAAACAGAAGAAAGAGTTCGATAAGAAATTAGAAAGGGTAAAGGAGAAACTGTTGCAGCCAATTCCTGAAAACGGTCCAGCGGTGATAACACCTAAGACCTGGCATCAGGATTGGTTTTCCAAGCAGTTCCAAACAGATCATCACTATCTAAAAGGAATTCGGGTCCATCGTAACTCCATTATGCATCGCGGAGCAATGATGAACATCGCTAAATACAAGCTGCGGGCTTCATCTTGCCCCGATATCTATAAGAATTCAATGTGGTCTGTAGAGGATCAAGAGGAAAAGACCTGGGGCAAGAAGATTCTGCACGCTATTAACAAAACATTCGACTTCAACATGTTCACGGAGTTCCACTTCCTCATGATGAACCTTTCGACCCTGGTGCTGTTTATATGGTTTATTGTGCCATATTTCTACATTTCCACGTTCATGGAAGCCAACGGCTACAGCGAGACCCAAGGATCCTTGATGCTGAGTGTTTTCGGCGTCGCAACCATAATTGGCATCGTCGGACTGGGATGGTTTGGCGACCTTCCTTGGGTCAACATAACAAAGACATACGCGATGTGTTTAGTAGCTTGCGGTATAACGATCGTGCTCTTCCCTATCCTGATACGAGTTATGGACGCGAGGGAGAGCTACAGCTTCTACATACTGGCGCTGAACTCATTGTTCTTCGGATTAACCTTCTCCAGTTCGTATTCGTATACGCCGAGTATTCTGGTGGAATTGATAGCGCTGGAGCGTTTCACCATGGCGTATGGATTGGTGCTGTTGAGTCAAGGGATTGGACATTTGATTGGACCGCCCATGGCTGGGGCCTTAAAAGATAGGACTGGATACTGGGACGCGGCATTCTACGTCGCAGGAATTTGGGTCGTTGTCTCTGGTTTGTTGGTAGGTGTGATACCATATACGAAAAACTTTAGAATCATTGGAAATGCACCGCTGGCTAAAGACGTAGCTGTGGATCCTGAACCCGGTGTAAAGATTATTATAGCTCATTAG

Protein sequence:

>DPOGS210839-PA
MDTQRTHANGIHGKEKLINHEHKIPSEEEKTSSPLLVKTNDFKYIDSYSEERHDLSEVNKSGNDLNTINSMKTEDDVQCNSLTVSYHNDKNDDDAKSNNERVKFVGLEDADSICSSKQPEPGPQIPDGGWGWVVVAASFLIATVADGLAFSYGLLQIKFVDFFEKSEAKTSLIGSLFISVPLIAGPIMSALVDRYGCKKMTIVGGIASTIGFVAASYSNTVEALYITYGIVAGLGMGLLYVTAVVSIAYWFEKRRNLAVGLGSCGVGFGTFIYSPLTTYLLDEYGWRGALLLLAGTVLNVCVCGTVMRDPEWLILEQKKQRKLSKSKRGSSSTSISAKSGGAESVYPGAEELKSLINSGEAPEHILSALISAIAEAENVEAVTKMNADLTQQQKVSSVLNLPTFLKQSERLPPEVIDQLATNERLYNIILQNYPSLLALRTGSDSKLTSEPLSKEKLKKKQKKEFDKKLERVKEKLLQPIPENGPAVITPKTWHQDWFSKQFQTDHHYLKGIRVHRNSIMHRGAMMNIAKYKLRASSCPDIYKNSMWSVEDQEEKTWGKKILHAINKTFDFNMFTEFHFLMMNLSTLVLFIWFIVPYFYISTFMEANGYSETQGSLMLSVFGVATIIGIVGLGWFGDLPWVNITKTYAMCLVACGITIVLFPILIRVMDARESYSFYILALNSLFFGLTFSSSYSYTPSILVELIALERFTMAYGLVLLSQGIGHLIGPPMAGALKDRTGYWDAAFYVAGIWVVVSGLLVGVIPYTKNFRIIGNAPLAKDVAVDPEPGVKIIIAH-