Monarch geneset OGS2.0

DPOGS201145
TranscriptDPOGS201145-TA1905 bp
ProteinDPOGS201145-PA634 aa
Genomic positionDPSCF300065 - 117247-138197
RNAseq coverage1049x (Rank: top 12%)
Annotation
HeliconiusHMEL0141520.085.77% 
BombyxBGIBMGA003950-TA2e-16588.79% 
DrosophilaCG6293-PA0.064.10% 
EBI UniRef50UniRef50_E2BK870.067.97%Solute carrier family 23 member 1 n=9 Tax=Pancrustacea RepID=E2BK87_HARSA
NCBI RefSeqXP_001606771.10.066.15%PREDICTED: similar to ascorbate transporter [Nasonia vitripennis]
NCBI nr blastpgi|3504117510.070.27%PREDICTED: solute carrier family 23 member 1-like [Bombus impatiens]
NCBI nr blastxgi|3838606460.069.81%PREDICTED: solute carrier family 23 member 1-like [Megachile rotundata]
Group
Gene OntologyGO:00160204.5e-261membrane
GO:00068104.5e-261transport
GO:00550854.5e-261transmembrane transport
GO:00052154.5e-261transporter activity
KEGG pathway 
InterPro domain[72-620] IPR0060434.5e-261Xanthine/uracil/vitamin C permease
Orthology groupMCL12074 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201145-TA
ATGGTGCATCAGAACATCCAAATCGGTCTGGAAAATGTCTTACGGGATTTGTTCAAGGAACTCTTCATAAGTACACCTGTACTGGACGTTTCACTGGCATGTTCGGTAGAACGTGTGGCAATAGTCGAGTACTCTTATAGGTCCCACCTAATGATTTTAACGTGGGATATCTTTAACATTCCCATGGAAGTGCTAACAGCCGACTGCTGGGACTCGAATCCTGAGAATGCGCAGAGCCTGCCAAAGGAGGAGAGAAAAGGGAACGTGACATACGGCATTGACGATGCACCGCCCTGGTATTTATGTATATTCATGGCTCTTCAGCACTACCTGACAATGATCGGTGCCATAGTCGCCATCCCTTTCATCCTGTGTCCGGCGCTGTGCATGCAGGAGACAGACCCTGACAGATCCAACATCATCTCAACGATGATATTCGTCACTGGATTAGTGACCTGGTTCCAAGCGACGTTCGGTTGTCGTCTGCCCATCGTGCAGGGTGGTACTATATCGTTCCTGGTGCCAACCTTGGCCATCCTCGGTCTGCCGACCTGGAAGTGTCCAGACTCCGGAACTCTCTCAGCGATGACGGATGATGAGAGACGCCTCGTGTGGACCACCAGAATGTGCGAATTGTCAGGCGCTATCGCTGTCTCGGCCTTATTCCAAGTCTTTGGAGGCTACTTCGGCATCATCGGCTCCTTACTGCGGTTTGTGACTCCGCTCACTATAGCGCCCACGGTGGCCCTGGTGGGACTTACCTTATTCGACCACGCGGCCGGCGCAGCTTCCCAGCAGTGGGGTATCGCCGCTGGAACTTTCACGTTGCTGACTATATTCTCACAATGCATGAGCGAAGTCCGGATACCGACGCTGACGTGGAAGCGAGCGAGCGGCTTCACTATCATATGGTTTCCACTGTTCAAGCTGTTTCCCGTGTTATTAACTATAGCTATAATGTGGGTGGTGTGCGGCGTGTTGACTGCTACAAACGTCTTCCCAGCGGGTCATCCGGCGAGGACTGACCTCAAGCTTAACATCATAGAGGACGCTCCGTGGTTCAGAGTTCCATATCCCGGTCAATGGGGTGTACCGACCGTGAGTGTTGCGGGTGTGTTGGGTATGCTGGCCGGAGTACTCGCCTGTACCGTGGAGTCCATCAGTTACTATCCCACCACAGCCAGGATGTGTGCGGCGCCCCCTCCCCCCCTGCATGCTATCAACCGCGGTCTTGGTACGGAAGGCCTGGGAACAATGCTGGCTGGTCTGTGGGGCTCCGGGAACGGAACTAACACCTTCGGCGAGAACGTTGGAGCTATTGGCGTTACCAAGGTAGGTTCCCGGCGCGTGGTACAGTGGGCGGCTGGCTTGATGGTGGTGCAGGGTGTCGTGGGTAAGCTCGGAGCCGTCTTCATCATCATACCGCAGCCAATCGTCGGCGGACTCTTCTGTGTCATGTTTGGGATGATCTCCGCCTTCGGTCTATCGGCTCTTCAGTACGTGAACCTGAACAGTTCGCGGAACCTGTACATCATCGGCTTCAGTCTGTTCTTTCCGCTGGTTCTGACTCGCTGGATGTCGGAACACAGCGGCGTCATACAAACTGGTGTGGAGGCTCTTGATGCGGTGCTTCAAGTGCTGCTGTCTACCAGCATACTGGTGGGAGGAGTCGTCGGTTGTTTGTTGGACAACCTGATACCAGGGACGGATGAAGAGAGAGGTCTGGCGGCGTGGGCGAAGGAGATGAGTTTAGAAACCAGCGGGGACTCTTACGGCAACACTTACGACTTTCCCATAGGAATGAGCCTCGTTACACGATTCACATGGACTCAGTACCTGCCGTTCATGCCGACCTACGAAGCCGGCAAGTTTACAGCTCTCTTCAAGAGGAAAAAGGAATCTTAA

Protein sequence:

>DPOGS201145-PA
MVHQNIQIGLENVLRDLFKELFISTPVLDVSLACSVERVAIVEYSYRSHLMILTWDIFNIPMEVLTADCWDSNPENAQSLPKEERKGNVTYGIDDAPPWYLCIFMALQHYLTMIGAIVAIPFILCPALCMQETDPDRSNIISTMIFVTGLVTWFQATFGCRLPIVQGGTISFLVPTLAILGLPTWKCPDSGTLSAMTDDERRLVWTTRMCELSGAIAVSALFQVFGGYFGIIGSLLRFVTPLTIAPTVALVGLTLFDHAAGAASQQWGIAAGTFTLLTIFSQCMSEVRIPTLTWKRASGFTIIWFPLFKLFPVLLTIAIMWVVCGVLTATNVFPAGHPARTDLKLNIIEDAPWFRVPYPGQWGVPTVSVAGVLGMLAGVLACTVESISYYPTTARMCAAPPPPLHAINRGLGTEGLGTMLAGLWGSGNGTNTFGENVGAIGVTKVGSRRVVQWAAGLMVVQGVVGKLGAVFIIIPQPIVGGLFCVMFGMISAFGLSALQYVNLNSSRNLYIIGFSLFFPLVLTRWMSEHSGVIQTGVEALDAVLQVLLSTSILVGGVVGCLLDNLIPGTDEERGLAAWAKEMSLETSGDSYGNTYDFPIGMSLVTRFTWTQYLPFMPTYEAGKFTALFKRKKES-