Monarch geneset OGS2.0

DPOGS201158
TranscriptDPOGS201158-TA1767 bp
ProteinDPOGS201158-PA588 aa
Genomic positionDPSCF300065 + 190727-193726
RNAseq coverage1001x (Rank: top 13%)
Annotation
HeliconiusHMEL0177816e-15387.06% 
BombyxBGIBMGA003955-TA0.083.19% 
DrosophilaMct1-PA8e-7433.45% 
EBI UniRef50UniRef50_UPI00022C92B30.061.59%UPI00022C92B3 related cluster n=3 Tax=unknown RepID=UPI00022C92B3
NCBI RefSeqXP_970199.20.060.10%PREDICTED: similar to monocarboxylate transporter [Tribolium castaneum]
NCBI nr blastpgi|3504087190.061.59%PREDICTED: hypothetical protein LOC100748998 [Bombus impatiens]
NCBI nr blastxgi|3287890630.061.63%PREDICTED: hypothetical protein LOC552505 [Apis mellifera]
Group
Gene OntologyGO:00550852.2e-25transmembrane transport
GO:00160212.2e-25integral to membrane
KEGG pathway 
InterPro domain[5-576] IPR0161961.3e-58Major facilitator superfamily domain, general substrate transporter
[4-154] IPR0117012.2e-25Major facilitator superfamily
Orthology groupMCL15764 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201158-TA
ATGGCGATGCCGCTGCTGTCCGGACCCATTGCTAGTTTCTTGACAGATAGATACGGCTGTCGACGCATGACAATATTTGGCTCGATATTGGCGTTTCTGGGATTCGTCATTTCGGCATTCGTCGACAATATGGAGACGTTATTCCTGACCTTTGGCATTATGGCCGGATTTGGATTGAGTTTGTGTTACGTAGCCGCTGTGGTTATAGTGGCTTACTACTTTGAGAAGAAACGTTCTCTGGCGACCGGTATATCGGTTTGCGGGAGCGGTATCGGAACTTTTGTATTCGCACCCTTAACATATATTTTACTAGACGAGTATGGGTGGCGGGGAACGACTTTGATACTCGCTGGGTTTTTCCTGAACATGGCTGTTTGTGGCTTGTTATTCCGAGATTTGCCATGGACTACTACTATGAACGCTGAGAAAGCGAGGGAACGAAAAAGGCGGCGAGAAAGAAAGAGAAACAAACGTTTTGGATCCTCAGTGGATAGTCTTTCTGATAGTAAAAGTAGCGCCGCCGGGTCCGCTAAAGTGACAGACACCGTGATAGAGGAGGCCTCAACGTCAATCGTACCTCAATTCAGTTCTTTGGTCGATTTGCCGACATTTATGACCGGAGGAGAAGGGGTTTCCTTAGAAGTATTCGAATTAATGTCCAACAGAGGGCGAGCGTATGCGTTACTTGCGCAAAACTATCCCGGGGTGATGTTGCCTTCTAGAAGTTTTAGTGATAGTGGGCGGTTACACGAACAATCTCCGCCAAAAGCTATCCTGTCCCCTGGTACAGTTTCTCCCGGGACACTGTCCCCAACTTCGGCTCCCAGTGCTCGAGAACAGAGCAACACAAGTACGAATGTTCACGAGAAAGCGGCTCTATCTTTGTGGTTGAGGAGACAGGCGGGTTCTACAAAAAAACCGCCAGCCTTCCTAAAGGACTTGCGAGTCCACAGACACTCGCTTACATACAGAGGTGCTATGCTTAATATAAACCGGTACAGACTCCGAGCGTCTTCCTGTCCCAACATATTCAGGAATTCCATGACGACGATTGCCAAAGAAAAGGTGCAGTGGTACGCTGGTTTATGGGACTTCTGGGACCTGGCAGTAGACATGCTGGATTTCTCTCACTTCCTGAACCCCGCCTTCCTCGTCTTCGCCATATCAAACTTCCTCCTCTACATGTGGTATGACGTGCCGTATGTTTACATCGCGGATAACGCTTTGAACATGGGCTTCAACGAGTCTCAAGCCTCTATGCTGATATCCATTATAGGAATATTGAACATGTTTGGTGAGATCCTACTGGGCTGGGTGGGCGATTGGAGTTGCGTGTCGCCTATCGTGGTGTACGCAGGTTGTATGTTGTTGTGTGGGCCGGTGACCCTGGTGATGCCTCTTCTGGATTCGTACCCGAAGGTCGCTGCGGCCGCCGGAGCCTTCGGAGCCTTCATCGCGGCTAACTACTCTCTCACCAGCATCATCCTAGTGGAACAGATCACCTTGGAGAAGTTCACGAACGCTTACGGATTGCTCCTGCTTATACAGGGTGTTGCCAATCTGGTGGGACCGCCGCTTGCAGGGTGGGTGTACGACATCACAGACTCCTATGACCTATCTTTCTACCTGGCTGGAGTGTTCATAGCCATCTCCGGTGTGATCCTCCTGATACTTCCCATATACAACATGGCCAAGAGATACAAGCAGAGGAAGTTGAGGAAGACCCAGCTGAATGGACTGCATCATAACGGAAAACTTCTACCATAG

Protein sequence:

>DPOGS201158-PA
MAMPLLSGPIASFLTDRYGCRRMTIFGSILAFLGFVISAFVDNMETLFLTFGIMAGFGLSLCYVAAVVIVAYYFEKKRSLATGISVCGSGIGTFVFAPLTYILLDEYGWRGTTLILAGFFLNMAVCGLLFRDLPWTTTMNAEKARERKRRRERKRNKRFGSSVDSLSDSKSSAAGSAKVTDTVIEEASTSIVPQFSSLVDLPTFMTGGEGVSLEVFELMSNRGRAYALLAQNYPGVMLPSRSFSDSGRLHEQSPPKAILSPGTVSPGTLSPTSAPSAREQSNTSTNVHEKAALSLWLRRQAGSTKKPPAFLKDLRVHRHSLTYRGAMLNINRYRLRASSCPNIFRNSMTTIAKEKVQWYAGLWDFWDLAVDMLDFSHFLNPAFLVFAISNFLLYMWYDVPYVYIADNALNMGFNESQASMLISIIGILNMFGEILLGWVGDWSCVSPIVVYAGCMLLCGPVTLVMPLLDSYPKVAAAAGAFGAFIAANYSLTSIILVEQITLEKFTNAYGLLLLIQGVANLVGPPLAGWVYDITDSYDLSFYLAGVFIAISGVILLILPIYNMAKRYKQRKLRKTQLNGLHHNGKLLP-