Monarch geneset OGS2.0

DPOGS211576
TranscriptDPOGS211576-TA1482 bp
ProteinDPOGS211576-PA493 aa
Genomic positionDPSCF300084 - 319890-330320
RNAseq coverage196x (Rank: top 48%)
Annotation
HeliconiusHMEL0169901e-12371.48% 
BombyxBGIBMGA006540-TA1e-4025.74% 
DrosophilaCG15890-PA4e-4024.18% 
EBI UniRef50UniRef50_UPI00020617A84e-4026.38%UPI00020617A8 related cluster n=1 Tax=unknown RepID=UPI00020617A8
NCBI RefSeqXP_001604312.15e-4427.27%PREDICTED: similar to ENSANGP00000027535 [Nasonia vitripennis]
NCBI nr blastpgi|1565520471e-4227.27%PREDICTED: proton-coupled folate transporter-like [Nasonia vitripennis]
NCBI nr blastxgi|3838550171e-4426.58%PREDICTED: proton-coupled folate transporter-like [Megachile rotundata]
Group
Gene OntologyGO:00550851.4e-16transmembrane transport
GO:00160211.4e-16integral to membrane
KEGG pathway 
InterPro domain[29-476] IPR0161961.7e-19Major facilitator superfamily domain, general substrate transporter
[106-426] IPR0117011.4e-16Major facilitator superfamily
Orthology groupMCL34898 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211576-TA
ATGACTGAAAGTTCTATAGATTTACCGGAATTGGAGAGATTAAGATCGACAAATACAGACGGAGTTGAAGGAACTCAAGCAAATCAAACTGAACACAAATGGCGACTCATTCTGGAGCCGGCGCTCGTTGGGTCCATGATGGCTATAAATCTTGGACAAACATCGCTTCAGAATTTTTACTTACGAACTGCTTGCACTGTTGATTTGGGTTATTCATTTAATATATGTGATAAAGGGGTTGGGGAGGAATTTCGTGCTGCTGAGGCAGCTAGCCAAGTGGTAGTTTCAAGCATAAATGTGAGCAGAAGTTTTGTTGGTTCACTTATTGCCACTATAGTGCTATTATTCGTTGGTCCATGGAGCGACTGTAGTGGACGTAGGAAACCTTTACTAATATTACCCCTACTCGGGATGTGTGTGATGACTACTGGAGTGCTATTGATGCTGACCTTTCCTGGTGCTGACACAAAACAAGTCCTCTATATGGTTCAGATACCGATATCAATGGGTGGAAACTTTGGCTTATTGTTAGCCGCGTCATTTAGTTACATTGGAGACCTTTGTCATGCCACCGGTCGTGATGTAACGAGAACAATGGGTACTCACCGCGCGGCGGTTCAATTCGCCCACGTTTTGGGAGCGGTTTGCGGTCCTTTGCTATACCGCCATCTCGGTTTCTACGGTGTGTTTCCACTCGTATTGTTCTTGCAGGTGGCATCCCTTATTTACGTGGTGATAACGGTCAAAGATGTCAATGTCAACAGTGACAACAAAGTTTCAGTTTTCAACTGGAGATTGCCATTGAACGCAATACAATGTTTGACGAGGAAGAGAGATGGCTACAAAAGACTTGTGATACTTTTAATGCTGATTGTTGCTCTTGGAGACAGAATGTTACTGTCAGCGGAGGTGCTACTCTCGTACATGTACTACAGATACAAATTTCAATGGGACGATGTTATGTTTGGATCATTCCTAGCTTACAGGAATACTATTAGTTTCGTGGGTACGTTGTTGATCTTGACGGTGTTGAAGCGTCGTCTCAAGCTATCAGACGAGGCGGTTGGCGCAATGAGCTGCGTATCTTATATGCTAGCCACCTCTTCACTTATAGCTTCCAAAACCACCTTACTTGTGTTTATGATTCCGATCGTAGGCATCATATCACAAGGCTCGCAAGTGGTACAGCGGCCTTTATTGAACAAACAGATCTTGCCAACAGAACAAGGTAAAATATACAGTGTTCTGGGAGCTCTAGAATCTGCGACACAAATGCTGTCGTCTCCATTGTACTCGCTGCTCTATACGAAAACTGTTTCCACTATACCGGACGCGTGGCTCATACCCGGGATCATTCTAGCTATGATACAACTCCTTTCATATCTATACACGAGAAGACTGCAACGATTAACGCCAAATGAAAAAAATATACCCCTACCTGTCATAAAAACAAACGAAAAGGTAGCAGATCTCAAGAGCTGA

Protein sequence:

>DPOGS211576-PA
MTESSIDLPELERLRSTNTDGVEGTQANQTEHKWRLILEPALVGSMMAINLGQTSLQNFYLRTACTVDLGYSFNICDKGVGEEFRAAEAASQVVVSSINVSRSFVGSLIATIVLLFVGPWSDCSGRRKPLLILPLLGMCVMTTGVLLMLTFPGADTKQVLYMVQIPISMGGNFGLLLAASFSYIGDLCHATGRDVTRTMGTHRAAVQFAHVLGAVCGPLLYRHLGFYGVFPLVLFLQVASLIYVVITVKDVNVNSDNKVSVFNWRLPLNAIQCLTRKRDGYKRLVILLMLIVALGDRMLLSAEVLLSYMYYRYKFQWDDVMFGSFLAYRNTISFVGTLLILTVLKRRLKLSDEAVGAMSCVSYMLATSSLIASKTTLLVFMIPIVGIISQGSQVVQRPLLNKQILPTEQGKIYSVLGALESATQMLSSPLYSLLYTKTVSTIPDAWLIPGIILAMIQLLSYLYTRRLQRLTPNEKNIPLPVIKTNEKVADLKS-