Monarch geneset OGS2.0

DPOGS210838
TranscriptDPOGS210838-TA2325 bp
ProteinDPOGS210838-PA774 aa
Genomic positionDPSCF300027 + 59539-64518
RNAseq coverage164x (Rank: top 51%)
Annotation
HeliconiusHMEL0213031e-16162.43% 
BombyxBGIBMGA003914-TA0.063.64% 
DrosophilaCG3409-PB1e-7660.59% 
EBI UniRef50UniRef50_F4WBF51e-17243.08%Monocarboxylate transporter 14 n=8 Tax=Formicidae RepID=F4WBF5_ACREC
NCBI RefSeqXP_393553.31e-17045.11%PREDICTED: similar to CG3409-PA [Apis mellifera]
NCBI nr blastpgi|3407170057e-17445.59%PREDICTED: hypothetical protein LOC100645876 [Bombus terrestris]
NCBI nr blastxgi|3407170055e-16947.09%PREDICTED: hypothetical protein LOC100645876 [Bombus terrestris]
Group
Gene OntologyGO:00550854.2e-31transmembrane transport
GO:00160214.2e-31integral to membrane
KEGG pathway 
InterPro domain[107-743] IPR0161964.7e-61Major facilitator superfamily domain, general substrate transporter
[141-320] IPR0117014.2e-31Major facilitator superfamily
Orthology groupMCL24954 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210838-TA
ATGCAATTGGATGAGAGTCCAAGCGTGGTTTTAAGAAAAAGCGCTCTTGGCGACAGAACGAAAGTGCAAGATGCTGTACGAGAATCTATCTTGACTGACTCTACTTCAGGGATAGTCAAAGGGGATAACGACTCCACAAGCATTGGTTCTTCGACTCCTTTCTCCAGTCCTGAATTAGACATAAGCGCCAAAGAACTTTTATCACCCAAAGTTAATAAGGACAGGAAAAGTGTACAGTACGAAGAAGATTTCGATGGAATCAATAATGATTTGGAAAAGAAAAGTGGTAAAAATCATGTAGATAAAACCGAAATAGTTGATGACGAAATAGCTAGAATTGAAAATGAAAAGCTTATTGGAGCAAATGACGTTACAAAAGTATCTATACCAGATGGGGGGTGGGGTTGGATAGTTGTTCTATCCTCTTTCGTAATATCTATGATCGCTGATGGCATTAGCTTCTCTTTTGGCTTGTTGTACATTGAATTTTTAGATGAATTTGGAGCAAGTAAATCAACAACAGCTTGGATTGGCAGTCTGTTTATAGCTGTGCCGCTTTTAGCTGGACCTGTGATGAGTGCTCTTGTAGACAGATACGGATGTAGAAGTATGACCATTTTAGGAGGGATTATTTCATCTTTGGGTTTCGTACTGGCTTCTGTGTCAACTACGTTGGAGACTATGATGCTCACTTTTGGAGTTATTGCGGGTCTGGGCTTAGGACTGGTATATGTAACAGCAGTAGTGTCCATAGCATATTGGTTTGAAAAGAAAAGGAATCTGGCTGTCGGCCTTGGGGCTTGTGGAACTGGTGTAGGTACATTTGTTTATGCACCGATGACTCAATATTTTATCGATGAATATGGTTGGAGAGGAACCATCCTTCTTTTGTCTGGTACGCTGTTAAATTTATGTGTCTGCGGTTGTGTTATGAGAGATCCGGAGTGGTGGATATTAGAGCAGAAACGGCAAAAGGAAGAAGAGAAGTCTACAAAAGATGACGTCAAATCTAAATATTCCAGCTCATATTCCGTTGCTAATGGTAATAATCCAAATGATTTTGAAATTCAAACGAAAGGGAAAAATATGAAAAGAATGATTAACAGAGGTGTATCACCTGAAAAGGTTATCAAAGATGAAGTGGCCAGCATACGCCGACTAGAAAATGTGATCTCTTCTCAATCAAATGACCGAAAATCTAGCTCATTACTGAATTTACCAACTTACTTAACATACAACGATAAGAAATCTTCCACATTTGTAGAGTGTTTATTAAATAAGTGGAATAATAAATCGGCGGTAAAATTTCCAACTAAAAACGAAGCTTTAGAAGAGAATGAAGAAAATCACTTAAATAGCAACAGCCCTAAAGAAATACTCAATAATGGCAAAATGCAGAGAACGTGCTCAGAACAGATAAATAGAAATAATGAGGAAATAGAGAGTTATAAGAGAAGGCATCCGAAGTTAGGGCAGATTAGAAAGTCTTTATCGGAAACGAAAGAAAACAGGCCATTATTGAAACAAGACAGCAAGGAAAGTAAACGAGAGTGGCTGAAGAAACAACTATCAGTGAATCATCATTATTTGAGAGATCTTAAAATGCCGATAAATTCATTAAGTCACAGGAACGCTATGTTGAATATAAAGAAATACAAATTGAAAGCTTCGTCATGTCCGGACATATTTAAAAACTCTATGATAACAGTGAATGAGCAAGAAGAGAAATGGTATGACGATTGTGTAAGTTGCCTTGGAGATATGTTCAACTTGTCTCTCTTCAAGAAGATTACATTCGACCTTCTTTGTATTGGGACCATAGTTATTTTCGTCTGGTTCATAGTCCCATACTTCTACCTCGCTGAACATATGATCCAGAATGGCTACTCCGAAGACGATGGCGCATTTATGTTGAGTCTGATAGGCGTCACCAACACTATCGGCATGGTCACCCTCGGCTGGATAGGAGATTTCCCGAATGTGGCAATTGGTAATCTTTACGCCCTTTGCCTTATTCTTTGTGGCGCCGCAGTCGCAGCTATACCCTTAGCTCCTAGCTATTGGATTCTCGCCTCTATTTGTGCTGCATTTGGGTTGCTTTTCGCAGCATCGTTTACTTTCACCCCTAGTCTTTTGGTGAAATTGGTTTCTCTGGATGATTTCACGTCTGCTTATGGATTGGTTCTGCTAGCTCAAGGTATTGGACATCTGATTGGACCGCCTTTGTCAGAAACTTCATGGTCTATACGAAAATATCAGCAGATACGCAATAAGCAGAGGCTTCTTAACGGAAAAGATGTAACCTTTCATCACGGCGACGCCTGA

Protein sequence:

>DPOGS210838-PA
MQLDESPSVVLRKSALGDRTKVQDAVRESILTDSTSGIVKGDNDSTSIGSSTPFSSPELDISAKELLSPKVNKDRKSVQYEEDFDGINNDLEKKSGKNHVDKTEIVDDEIARIENEKLIGANDVTKVSIPDGGWGWIVVLSSFVISMIADGISFSFGLLYIEFLDEFGASKSTTAWIGSLFIAVPLLAGPVMSALVDRYGCRSMTILGGIISSLGFVLASVSTTLETMMLTFGVIAGLGLGLVYVTAVVSIAYWFEKKRNLAVGLGACGTGVGTFVYAPMTQYFIDEYGWRGTILLLSGTLLNLCVCGCVMRDPEWWILEQKRQKEEEKSTKDDVKSKYSSSYSVANGNNPNDFEIQTKGKNMKRMINRGVSPEKVIKDEVASIRRLENVISSQSNDRKSSSLLNLPTYLTYNDKKSSTFVECLLNKWNNKSAVKFPTKNEALEENEENHLNSNSPKEILNNGKMQRTCSEQINRNNEEIESYKRRHPKLGQIRKSLSETKENRPLLKQDSKESKREWLKKQLSVNHHYLRDLKMPINSLSHRNAMLNIKKYKLKASSCPDIFKNSMITVNEQEEKWYDDCVSCLGDMFNLSLFKKITFDLLCIGTIVIFVWFIVPYFYLAEHMIQNGYSEDDGAFMLSLIGVTNTIGMVTLGWIGDFPNVAIGNLYALCLILCGAAVAAIPLAPSYWILASICAAFGLLFAASFTFTPSLLVKLVSLDDFTSAYGLVLLAQGIGHLIGPPLSETSWSIRKYQQIRNKQRLLNGKDVTFHHGDA-