Monarch geneset OGS2.0

DPOGS209511
TranscriptDPOGS209511-TA1734 bp
ProteinDPOGS209511-PA577 aa
Genomic positionDPSCF300127 + 93731-120166
RNAseq coverage703x (Rank: top 18%)
Annotation
HeliconiusHMEL0160202e-8384.00% 
BombyxBGIBMGA005389-TA7e-8535.37% 
DrosophilaCG3036-PA2e-15958.23% 
EBI UniRef50UniRef50_E2B2D24e-16257.25%Sialin n=9 Tax=Endopterygota RepID=E2B2D2_HARSA
NCBI RefSeqXP_001844052.12e-16660.63%sodium-dependent phosphate transporter [Culex quinquefasciatus]
NCBI nr blastpgi|1700323653e-16560.63%sodium-dependent phosphate transporter [Culex quinquefasciatus]
NCBI nr blastxgi|1954373826e-16559.78%GK24471 [Drosophila willistoni]
Group
Gene OntologyGO:00550851.2e-49transmembrane transport
GO:00160211.2e-49integral to membrane
KEGG pathway 
InterPro domain[117-564] IPR0161962e-73Major facilitator superfamily domain, general substrate transporter
[120-514] IPR0117011.2e-49Major facilitator superfamily
Orthology groupMCL17065 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209511-TA
ATGGCTAAGACAGAGAAGTCGTTCCCAAGAAACTTTACAGGTTGCCTAACATGCGTCCAAGTACTGAACATAATGGTACTTCTGGGCTTCATGTTGAACTACGCGTTGAGAGTCAACCTGACCATCGCCATCGTAGAAATGATCTATGACAACAAGACGGATGCACTCGCCATGGTTACCAACATTACGGACACCAATACCACAAACGTCACACAGCACCCGGAACACATCGAGCAAACCCGCTACCACTGGGACACGAAACAGAAAAACCACCTCCTGGGCTCCTTCTTCTGGGGCTACGTACTCACAGAAGTACCAGGTTGCCTAACATGCGTCCAAGTACTGAACATAATGGTACTTCTGGGCTTCATGTTGAACTACGCGTTGAGAGTCAACCTGACCATCGCCATCGTAGAAATGATCTATGACAACAAGACGGATGCACTCGCCATGGTTACCAACATTACGGACACCAATACCACAAACGTCACACAGCACCCGGAACACATCGAGCAAACCCGCTACCACTGGGACACGAAACAGAAAAACCACCTCCTGGGCTCCTTCTTCTGGGGCTACGTACTCACAGAAGTACCAGGTGGCCGTCTAGCTGAAGTTATAGGCGCGCGACGGGTCTTCGGTTATAGCACTCTGTTAGCTAGTATCCTAACCCTGCTGTCGCCCGCCGCCGCCTCAGCTGGTTTCGGTTGGATCGTGGCTCTGAGAGTCCTTCTCGGATTCTTCCTGGGAGCCACATGGCCAGCCATACTGCCCATGGCGTCCAAATGGATTCCGCCTATGGACAGATCGAAATTCATGTCGAACATGATGGCGTCGTCCCTGGGCGCGGCCATCACGATGCCCATCTGCGGTTTCCTTATCGCGCACTTCGGATGGGAGTCAGCGTTTTATTTCACTGGTATAATCGGTGTGATGTGGTCCATGGCGTGGTTCGCCGTGGTCTACGACTCGCCCCACCAACACCCTCGCATCACCGACGCCGAGAGGAACGCGCTCATGAAGGCGCTGCCTCAGGACACCAGCCAGCCCGTAAACCAGCCAGTTCCGTGGAGGTCGCTGCTGACGTCACCTCCGGTGTGGGCCATCGTGGTCACGCACGGCGCCTCGGTGTTCGGATACTTCACCGTGGTCAACCAGCTGCCCACGTACATAGAGTCCATACTGCACTACGACATCAAGCACAACGGCCTGCTGTCATCCCTGCCGTACCTGGGCAAGTACCTGTGCGCCCTAGCCTCGTCCGTGCTGGCGGACTCGTTGAGACGAAGTGGAAAGCTCTCTACTACTGCAGCTAGGAAGCTCTTCACTGGATTCGCTGTGGGTCTACCTGGCGTGATGATGGTGATGCAGGCGTACTTCGGCTACGAGCGCGTGTGGTCGATCGCCATATTCACAGCTGCCCTTACTATCAACGGCGCTGTGACGGCTGGTTACCTCGGGAATGGACTTGATATCGCGCCCAAATTTAGCGGCACTATCTTCGGTATAGCTAACACTCTATCTTCATTTGGTGGCTGGCTCTCCACTTTCATGGTCGGAGAACTCACTAAGCACGAGAACACGTACGAGCAATGGCAGATAGTGTTCTTTATCCTGGCGGGGACGTACGCGTGCGGGGCGCTGGTGTTCCTCATGTTCGGCTCCGGGGAGCTGCAGCCCTGGAGCAAGCCGGTCGTTAAAGAACTCACTCCATTGAGAGAGCAACAGGCTTAG

Protein sequence:

>DPOGS209511-PA
MAKTEKSFPRNFTGCLTCVQVLNIMVLLGFMLNYALRVNLTIAIVEMIYDNKTDALAMVTNITDTNTTNVTQHPEHIEQTRYHWDTKQKNHLLGSFFWGYVLTEVPGCLTCVQVLNIMVLLGFMLNYALRVNLTIAIVEMIYDNKTDALAMVTNITDTNTTNVTQHPEHIEQTRYHWDTKQKNHLLGSFFWGYVLTEVPGGRLAEVIGARRVFGYSTLLASILTLLSPAAASAGFGWIVALRVLLGFFLGATWPAILPMASKWIPPMDRSKFMSNMMASSLGAAITMPICGFLIAHFGWESAFYFTGIIGVMWSMAWFAVVYDSPHQHPRITDAERNALMKALPQDTSQPVNQPVPWRSLLTSPPVWAIVVTHGASVFGYFTVVNQLPTYIESILHYDIKHNGLLSSLPYLGKYLCALASSVLADSLRRSGKLSTTAARKLFTGFAVGLPGVMMVMQAYFGYERVWSIAIFTAALTINGAVTAGYLGNGLDIAPKFSGTIFGIANTLSSFGGWLSTFMVGELTKHENTYEQWQIVFFILAGTYACGALVFLMFGSGELQPWSKPVVKELTPLREQQA-