Monarch geneset OGS2.0

DPOGS203615
TranscriptDPOGS203615-TA1293 bp
ProteinDPOGS203615-PA430 aa
Genomic positionDPSCF300063 + 170309-173480
RNAseq coverage10x (Rank: top 84%)
Annotation
HeliconiusHMEL0224041e-11852.68% 
BombyxBGIBMGA007281-TA2e-9858.16% 
Drosophilal(2)01810-PA4e-6433.99% 
EBI UniRef50UniRef50_B2DBK36e-10664.94%Similar to CG5304-PA n=3 Tax=Papilionoidea RepID=B2DBK3_9NEOP
NCBI RefSeqXP_968382.12e-8337.96%PREDICTED: similar to sodium-dependent phosphate transporter [Tribolium castaneum]
NCBI nr blastpgi|1839792982e-10564.94%similar to CG5304-PA [Papilio xuthus]
NCBI nr blastxgi|1839792989e-14050.72%similar to CG5304-PA [Papilio xuthus]
Group
Gene OntologyGO:00550851.1e-12transmembrane transport
GO:00160211.1e-12integral to membrane
KEGG pathway 
InterPro domain[1-386] IPR0161964.3e-31Major facilitator superfamily domain, general substrate transporter
[36-158] IPR0117011.1e-12Major facilitator superfamily
Orthology groupMCL23655 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203615-TA
ATGGAAAGTGATAAAGGCATTCTTAGCACTTTGCAAGCATGCTGCATTATCCCTAAAAGATATATATTTTCAATAATGGCAATGTTCGGAATTTTGAATGTGTTCACGATGCGAGTAAGCCTCAACATAGCGATAACACAAATGGTCAGACATGTCAAGACCGTCGGGGGGCACTTTGATCCTGATGCGTGCCCCAGTGACGATGTAGAAGGAAATAGAACTATCGTTTTAAATCCACATGCCGTATTTGATTGGGACGAGCGAACACAGGGCCACCTTTTAAGTGGTTTTTATTATGGATATGCCACAACGCAGTTTTTGGGAGGATATTTAGTGGAACGCTACGGTGGAAAATGGACAATTGGTCTCGGTCTTTTAAGCACATCAATATTTACTTTGCTCACACCGGTAGTTTTAAAAGTAGGCGGAGAAACTTGGTTATTTATTTTGCGAGTCCTCCAGGGTATGGGTGAGATAATACTATGCTATAGCGAGCCAAACACTCATCCATTTATTTCAAAAGCGGAGTTAAACCATTTGAATAAAGTTGTAATAAGATCTGAACATAAACATTGTAAAGACACCGTACCTTGGAAAGCAATTTTACGGTCACCGCCAACATGGGCTCTTTTAGTTTCACATGTTGGTCATGACTGGGGTTTGTATACGATGATAACAGATCTTCCAAAATACTCATATGATGTACTAAAGTTTAATATAACTGACACTGGACTACTTTCTGGATTACCTTATGCGGCAATGACTCTTTGTTCATTTCTGTTCGGATATATTAGTGACCTATGTATTGAAAAAGCCGCAGCTGGTCCAGCTATTTGTATAATACTGGCTTCGTATTCAGGATGTGATAGAAACGCAGCGATGATTTATTTTATAATATCTATGGGACTTATGGGTGCTTACTACAGTGGAATGAAGATTAACACGCTGGATATCGCACCAAACTTTGCTGGGAGTTTAACATCATTGATCAATACATCATCAACATTCACTGGCATAATATCACCATTTCTAATAGGTCTCTTGACTCCAGATTCAACGTTAGTCCAGTGGAGAACTGCTTTTTGGGTGTGCTTTGCTATGTTAATGAGTACCAATTTGATTTATTGCTTATTCACGGAAACGGAACAGCAGTGGTGGGATGATGTAAGGAAGTTCGGATATCCGGAAGACTGGAAGCATGGACCCATAGTCAAGGATGAAATCAAAAATTCGGAAGAAGAGCAGTTGAATAAATATAGGGATAAGGAACTTGATAGCATTATATCTAAATAG

Protein sequence:

>DPOGS203615-PA
MESDKGILSTLQACCIIPKRYIFSIMAMFGILNVFTMRVSLNIAITQMVRHVKTVGGHFDPDACPSDDVEGNRTIVLNPHAVFDWDERTQGHLLSGFYYGYATTQFLGGYLVERYGGKWTIGLGLLSTSIFTLLTPVVLKVGGETWLFILRVLQGMGEIILCYSEPNTHPFISKAELNHLNKVVIRSEHKHCKDTVPWKAILRSPPTWALLVSHVGHDWGLYTMITDLPKYSYDVLKFNITDTGLLSGLPYAAMTLCSFLFGYISDLCIEKAAAGPAICIILASYSGCDRNAAMIYFIISMGLMGAYYSGMKINTLDIAPNFAGSLTSLINTSSTFTGIISPFLIGLLTPDSTLVQWRTAFWVCFAMLMSTNLIYCLFTETEQQWWDDVRKFGYPEDWKHGPIVKDEIKNSEEEQLNKYRDKELDSIISK-