Monarch geneset OGS2.0

DPOGS202990
TranscriptDPOGS202990-TA1098 bp
ProteinDPOGS202990-PA365 aa
Genomic positionDPSCF300068 - 312136-314827
RNAseq coverage1417x (Rank: top 9%)
Annotation
HeliconiusHMEL0110315e-16075.20% 
BombyxBGIBMGA012335-TA5e-14868.60% 
DrosophilaTango9-PA3e-5533.05% 
EBI UniRef50UniRef50_B7QC122e-10556.20%Transmembrane protein C2orf18, putative n=4 Tax=Arthropoda RepID=B7QC12_IXOSC
NCBI RefSeqXP_001656408.11e-11758.09%hypothetical protein AaeL_AAEL000459 [Aedes aegypti]
NCBI nr blastpgi|1571347192e-11658.09%hypothetical protein AaeL_AAEL000459 [Aedes aegypti]
NCBI nr blastxgi|1571347194e-11458.24%hypothetical protein AaeL_AAEL000459 [Aedes aegypti]
Group
Gene OntologyGO:00160211.9e-134integral to membrane
GO:00086432.5e-07carbohydrate transport
GO:00053512.5e-07sugar:hydrogen symporter activity
GO:00001392.5e-07Golgi membrane
KEGG pathway 
InterPro domain[1-365] IPR0124041.9e-134Uncharacterised conserved protein UCP036436, nucleotide-sugar transporter-related
[92-206] IPR0072712.5e-07Nucleotide-sugar transporter
Orthology groupMCL14909 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202990-TA
ATGGCATGGACCGGATACCAGAAGTTTCTAGCACTAGTTATGGTGGTAACGGGGTCTATTAACACCCTGAGTACCAAGTGGGCAGATAACATCGATTCTAAAGGTTCCGATGGGATAGTTCGCACATTTCAACATCCATTCCTCCAGGCTTTGTTTATGTTTTTCGGTGAAATGATGTGTCTGTGGACTTTCAAATTAGTTTATTGGTGGAACAGAAGGAGCGGCACAGAAAGCCAATTGACTCAGGGCAGCCAAGACTTTAACCCATTCATATTGATGCCCGCAGCTATGTTTGACTTGATTGGAACATCAATCATATATATTGGGCTGACCCTGACCTATGCCAGCAGCTTCCAGATGTTCCGAGGTTCTATTATTGTGTTCGTAGCTCTGTTCTCAACTATATTACTTGATAGAGTTATTAAAAGACGAGAATGGTTTGGAATATCTCAATTAATCTTGGGACTGATAATAATTGGTGCTACCGATGCCATCTATCAGTCCCCCGATGACTCTAAAGGTAGAAATAGTATGATAACTGGGGACTTGTTGATAATCCTGGCCCAAATTATACTCGCCTGCCAAATGGTTTACGAAGAGAAATTTGTATCTGGTCTCAATATTCCACCATTACAAGCTGTCGGCTGGGAGGGTGTTTTTGGATTTTCAATGCTATCGGTTCTACTTGTGATATTCTACTTTATCCCAGCCCCACCGCACTTTGACAACAACGCTAGGCATACCGTCGAGGATTTTATTGATGGACTGGTGCAAATAGGAAACAACTCATTTCTACTGCTCGCTATAATGGGAACTGTAGTTTCCATAGCGTTTTACAACTTTGCTGGTATCAGTGTCACCAAGGAAATGTCTGCCACCACAAGAATGGTCCTAGATTCTGTGAGGACCCTCGTCATCTGGATGGTATCTCTTGGAGTGAAATGGCAGGTTTTCCATTGGCAGCACCTGATAGGTTTTGCTATCCTAATCTTTGGTATGGCCGTCTATTATGACATAATCCCAATGAATTCCAGAAGACCTGCCGACGATGAAACTCCTGTTGTTAACTCGGAAGCTGATAGACTCGAAGAAGCTTAA

Protein sequence:

>DPOGS202990-PA
MAWTGYQKFLALVMVVTGSINTLSTKWADNIDSKGSDGIVRTFQHPFLQALFMFFGEMMCLWTFKLVYWWNRRSGTESQLTQGSQDFNPFILMPAAMFDLIGTSIIYIGLTLTYASSFQMFRGSIIVFVALFSTILLDRVIKRREWFGISQLILGLIIIGATDAIYQSPDDSKGRNSMITGDLLIILAQIILACQMVYEEKFVSGLNIPPLQAVGWEGVFGFSMLSVLLVIFYFIPAPPHFDNNARHTVEDFIDGLVQIGNNSFLLLAIMGTVVSIAFYNFAGISVTKEMSATTRMVLDSVRTLVIWMVSLGVKWQVFHWQHLIGFAILIFGMAVYYDIIPMNSRRPADDETPVVNSEADRLEEA-