Monarch geneset OGS2.0

DPOGS202411
TranscriptDPOGS202411-TA1719 bp
ProteinDPOGS202411-PA572 aa
Genomic positionDPSCF300233 + 3652-15279
RNAseq coverage223x (Rank: top 45%)
Annotation
HeliconiusHMEL0036910.069.57% 
BombyxBGIBMGA003437-TA0.063.00% 
DrosophilaCG42269-PE8e-14645.03% 
EBI UniRef50UniRef50_Q9VRT41e-14345.03%CG42269, isoform E n=25 Tax=Neoptera RepID=Q9VRT4_DROME
NCBI RefSeqNP_648019.22e-14445.03%CG42269, isoform E [Drosophila melanogaster]
NCBI nr blastpgi|2213309074e-14345.03%CG42269, isoform E [Drosophila melanogaster]
NCBI nr blastxgi|2213309076e-14946.08%CG42269, isoform E [Drosophila melanogaster]
Group
Gene OntologyGO:00550859.4e-29transmembrane transport
GO:00160219.4e-29integral to membrane
GO:00228579.4e-29transmembrane transporter activity
KEGG pathway 
InterPro domain[164-543] IPR0161962.3e-46Major facilitator superfamily domain, general substrate transporter
[149-534] IPR0058289.4e-29General substrate transporter
Orthology groupMCL24901 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202411-TA
ATGATATTAAACGATTTAAATGAAGACCAACACACGAAGGATAAGAAAGAAGATGAGAAAATTAAAGGCAATGACAGCGATGCGTTGGAATCTGTGCTGGTACACATAGGTCAGTTCGGCCGCTACCAGAAGCTGTTGTTCCTGGGGATGTTACCAGTGGGAGTGTTTTTCGCATTCATATATTTCGTCCAAATGTTCATAGCCGCTACCCCACAACGGCACTGGTGTAAAGTTCCAGAACTAACCCATTTAGATCCCGAGTTGAGGCGAAATTTGACTGCTCCTCCACCAAGTATTGACGGCGAGGAATGGAACCGCTGCTTCATGTACGACGCCAACTGGACCGAGGTTTTGCTCAGCAACAGAACCGATCCCGACACACCCATCACAGCCTGCAAGAACGGCTGGGAGTTCGAACTGAAAGACATCCCATACCATACTGTAGTTAGTGAGCGCGGCTGGGTGTGTGATAATTCCGGCTACACTCCCTTCACTCAGACTGTTTTCTTCATTGGCTCCTTCTTTGGAGGTTTATTTTTTGGATGGGTATCTGACTACTTTGGAAGGGTACCGGCTTTGTTTGGTGCCAACGTAATGGCGTTGGTCGGCGGCATAGCAACAATATACACGACGGAGATATGGGACTTCGCTTTCTGTCGGTTCATCGTCGGAACTTCCTACGACACCTCTTTTATGGCCATGTATATTTTAGTGTTAGAGTACACGGGTCCCAAGTACAGGACGTGGGCAGCGAACATGTCGATAGCTCTGTTCTTCGGTGGGGGCTGCCTCATCCTTCCCTGGGTGGCGTGGTGGGTCTCGGATTGGAGGACACTCCTCTGGGTCACTTCACTGCCCATGCTGCTAGTGGTAGTTGTGCCTTTCACTGTACCGGAGAGCGCTAGATGGCTTTCATCTCGGGGGCGGGTTAACGATGCTGTAAAAGTTCTGAGAAGATTCGAAAGAGTCAATGGAAGGAAAATACCTGATGACGTCATGGACGAGTTTATTGTCTCAGCCAGTCAGACTCGCCGCACAAAAGAGTCTATAATGGATGTCTTCAAGAGTTCAGCTCTCCGAGGTCTCCTCGCTCGGATGGTTATAGTGTACATGACATGCGCGCTGGTCTTCGACGGGCTGGTCCGTATGTCGGAAGGCCTCGGGTTGGACTTTTTCGTGACCTTCACTTTGACTTCCGCCACCGAGATACCGTCAGTCATGCTGCTGGCCTTAGTACTGGACAGGTTCGGTCGTCGGGTTCTGACCGCCGTCCCGCTGTTGATATCAGGGACGCTGATACTCATTGCAACTTTGGTACCTAGAGGTGTACCGCAAGTGTCGCTGGCGATCATGGCCAGGTTCATGATCAACATGGCGTACAACGCGGCCATCCAGTGGTCCGCTGAGCTGATGCCGACTCCCGTGCGAGGATCGGCCTCCTCCTTCATACACGTCATGGGATACGTCTCCACGCTCGTGTCCCCCTTCGTCGTGTACTCGGAACGAGCGTGGAAGTTATTGCCTCTCCTCATTCTGGGGGCGCTGTGTTTGGTGGCGAGCGGAGTTTCTCTCATGGTGCCGGAGACCAACGGCCGGCCGATGCCACAGACTATAGAAGAGGGCGAACAGGTCGTGAGGAGCTACACGCTATGCGGGAAAGTCGAAGACCCAGAAGTAACTGCTGAACAAAACGAGAAAGAGAAGGCGCTCATCACTTAG

Protein sequence:

>DPOGS202411-PA
MILNDLNEDQHTKDKKEDEKIKGNDSDALESVLVHIGQFGRYQKLLFLGMLPVGVFFAFIYFVQMFIAATPQRHWCKVPELTHLDPELRRNLTAPPPSIDGEEWNRCFMYDANWTEVLLSNRTDPDTPITACKNGWEFELKDIPYHTVVSERGWVCDNSGYTPFTQTVFFIGSFFGGLFFGWVSDYFGRVPALFGANVMALVGGIATIYTTEIWDFAFCRFIVGTSYDTSFMAMYILVLEYTGPKYRTWAANMSIALFFGGGCLILPWVAWWVSDWRTLLWVTSLPMLLVVVVPFTVPESARWLSSRGRVNDAVKVLRRFERVNGRKIPDDVMDEFIVSASQTRRTKESIMDVFKSSALRGLLARMVIVYMTCALVFDGLVRMSEGLGLDFFVTFTLTSATEIPSVMLLALVLDRFGRRVLTAVPLLISGTLILIATLVPRGVPQVSLAIMARFMINMAYNAAIQWSAELMPTPVRGSASSFIHVMGYVSTLVSPFVVYSERAWKLLPLLILGALCLVASGVSLMVPETNGRPMPQTIEEGEQVVRSYTLCGKVEDPEVTAEQNEKEKALIT-