Monarch geneset OGS2.0

DPOGS200810
TranscriptDPOGS200810-TA1482 bp
ProteinDPOGS200810-PA493 aa
Genomic positionDPSCF300249 - 17893-23147
RNAseq coverage423x (Rank: top 29%)
Annotation
HeliconiusHMEL0126013e-9840.96% 
BombyxBGIBMGA005389-TA0.079.13% 
DrosophilaCG4330-PA2e-16958.82% 
EBI UniRef50UniRef50_Q9VYG73e-16758.82%CG4330 n=10 Tax=Drosophila RepID=Q9VYG7_DROME
NCBI RefSeqXP_002106820.17e-16960.86%GD17102 [Drosophila simulans]
NCBI nr blastpgi|2700124710.066.05%hypothetical protein TcasGA2_TC006625 [Tribolium castaneum]
NCBI nr blastxgi|2700124710.066.05%hypothetical protein TcasGA2_TC006625 [Tribolium castaneum]
Group
Gene OntologyGO:00550851.2e-52transmembrane transport
GO:00160211.2e-52integral to membrane
KEGG pathwaycin:1001873642e-106 
 K12301 (SLC17A5)maps-> Lysosome
InterPro domain[7-476] IPR0161962.8e-78Major facilitator superfamily domain, general substrate transporter
[77-438] IPR0117011.2e-52Major facilitator superfamily
Orthology groupMCL16941 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200810-TA
ATGATAAATGCTGATATAGAAAATGATGTCACCAGCAGTCACAGGAATTTAGTTGAAACCGAGAGAGTTGAGGAGACAACAGGATGGATCAAGTGCAGAACTGTGTTGGGGATCATGGGATTACTAGGGTTTGCAAATGTTTATGCAATGCGTGTAAATCTATCAGTGGCAATAGTGGCCATGATAAACTCCACAGAGCCATTGCCTTCCAATGACACCACATTGGATGTTTGTCCGACAAGCTTACCCAGTAATAATACTATTCCACCTAAACAAGGGGAATTCAACTGGACGGCAGAACAACAAAGTATAATACTTGGATCATTTTTCTATGGCTATGTTTTAACACAGATTCCGGGAGGACGCATTGCCGAGATGTTTGGGGGAAAATTAGTATATGGAATTGGAGTTTTGTTAACAGCTGTATTTACAATACTCAGTCCAATAGCTGCGTATATAGATTTTAAGTTTTTTATAGTAGTGCGAGTGTTAGAAGGCCTGGGTGAGGGTGTGACATACCCGGCCATGCATGCTATGTTATCAAGATGGATTCCACCATTAGAGAGATCAAAGTTTGCCGCCTATGTTTATGCTGGTTCTAACATTGGTACAGTGATATCACTGCCAATATCTGGTTGGCTGTGTACTTTAGACTTTGCTGGTGGTTGGCCTTTGTGCTTCTACATATTTGGAGGTCTGGGGATCATATGGTTTATTGCCTGGATGTTCCTTATATATGATACACCTCAAAAGCATCCGAGAATATGTCCGAAAGAGGTGGAATTTATCACAGAAAGTATCGGTGTTCAGGAGGAGCATAGACAATCGATTCCTTGGTGCAAGTTTTTAACGTGTCTGCCGTTGTGGGCTATATTGATAGCACAGTGTGGACAATCATGGCTCTTCTACACTCAGCTGACAGAACTGCCAACTTACATGAACAATATACTACACTTTGACATCGTATCTAACGCACGTCTTCTGGCGTTGCCATATTTATCGTCATGGGTGGCTGGTATAGGTATCAGTATATTTGCGGACTGGTTGCTAGCAAAGGGCTGGATATCGAGACTGAACAGTATGAAGCTCTGGAACACTGTCGGCTCGTTCATCCCAGCCCTGGGCTTGCTGGGTATCGCGTGGGCTGGCTGCGATCGTTTGTCTGTTATGTTGTTACTAACAATAACATCCGCCTTCGGTGGAGCCGTATACGCTGGTAATCAGATGAACCACATCAACCTGTCTCCTCAGTTCGCTGGGACCATGTACGGTATCACAAACGCAGCCAGCAACATTTGCGGGTTCATGGCGCCCTACGTTATAGGACTCATAATTAGTGACACACAGCAAACATTAGGACAATGGCGCGAAGTGTTCTACTTAGCGGCTGCCATTGATCTGGGAGCGAATCTGTTTTATTTGTTCTTTGCTAGTACTGAAGAACAGTATACTATATGTGAGACATTACATGTAAATACTTAG

Protein sequence:

>DPOGS200810-PA
MINADIENDVTSSHRNLVETERVEETTGWIKCRTVLGIMGLLGFANVYAMRVNLSVAIVAMINSTEPLPSNDTTLDVCPTSLPSNNTIPPKQGEFNWTAEQQSIILGSFFYGYVLTQIPGGRIAEMFGGKLVYGIGVLLTAVFTILSPIAAYIDFKFFIVVRVLEGLGEGVTYPAMHAMLSRWIPPLERSKFAAYVYAGSNIGTVISLPISGWLCTLDFAGGWPLCFYIFGGLGIIWFIAWMFLIYDTPQKHPRICPKEVEFITESIGVQEEHRQSIPWCKFLTCLPLWAILIAQCGQSWLFYTQLTELPTYMNNILHFDIVSNARLLALPYLSSWVAGIGISIFADWLLAKGWISRLNSMKLWNTVGSFIPALGLLGIAWAGCDRLSVMLLLTITSAFGGAVYAGNQMNHINLSPQFAGTMYGITNAASNICGFMAPYVIGLIISDTQQTLGQWREVFYLAAAIDLGANLFYLFFASTEEQYTICETLHVNT-