Monarch geneset OGS2.0

DPOGS209052
TranscriptDPOGS209052-TA1269 bp
ProteinDPOGS209052-PA422 aa
Genomic positionDPSCF300102 + 89566-92157
RNAseq coverage348x (Rank: top 34%)
Annotation
HeliconiusHMEL0060943e-8361.98% 
BombyxBGIBMGA010038-TA8e-7356.96% 
Drosophilasll-PA4e-9246.58% 
EBI UniRef50UniRef50_D6WTZ86e-9446.04%Slalom n=4 Tax=Tribolium castaneum RepID=D6WTZ8_TRICA
NCBI RefSeqXP_001359557.21e-9347.09%GA20487 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|2700111212e-9346.04%slalom [Tribolium castaneum]
NCBI nr blastxgi|3838635937e-9245.96%PREDICTED: adenosine 3'-phospho 5'-phosphosulfate transporter 1-like [Megachile rotundata]
Group
Gene OntologyGO:00550851e-69transmembrane transport
KEGG pathway 
InterPro domain[117-411] IPR0136571e-69UAA transporter
Orthology groupMCL13920 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209052-TA
ATGAGATCAAAATTCGTGATTGGGACTGTGCTTGGTTTGTTTGTACTTTTTGGCTGGCTACTAGGACATATATATGGGGTGCTCCTAGCTGTCTATGAAGAGACAACAATTTTCAAAGACTTGGAGTACTCATGGGTGTTCAGATTACTATTAAACTTAGTAGGCTACTCCACCGTGATACTGCCATGCTTTGCACTTTACAAATATCTTGAAAAGACACACTACTTCGAAAAAATAACTAACAACAATTGGGTGAGTCGTCTGCTGCGAACGATGTTTTTGGAACAGGAGCGGCTGCCGGAGGTGGTGCGTGTGGACGAGAGCCTCCCTCATGAAAGTGTCGAGTTAGCGTTGTGTGTGGTGGGACTCATGGGGGCTTACCTCGTGTGGGGATTGCTACAGGAGAAGATAATGACAACTGACTACGTGTTGTCGGACGGCTCCCTGTGCCGCTTCACGGACTCTCAGTTCCTGGTGTTCGTGAACCGCGTGCTGGGGTCGTTGGTGGCACTGGTGCGTCTCCGCGCCACGCGGCGGCCTCTGTTCCCCGCTCCCCTCTACAAGTTCTCGTACTGCGCGCTCACCAACATCGTCAGCGCCTGGTGTCAGTACGAGGCCCTCAAGTTCGTCAGCTTTCCCACTCAAGTGCTGTCCAAGTCGTGTAAGGTGATCCCGGTGATGCTGATGGGGAAGCTGATCTCCCGCGCCAAGTACGAGTCCTACGAGTACGTCACCGCCGTCCTCATCTCGCTCGGCATGGCGCTGTTCCTGTTCGGGACCGGCGAGGACCACGCGTGGGGCGCGCCGAGCGTGTCCGGGGCGTGCCTGCTGGTGCTGTACCTGTGCTGCGACAGCTTCACGTCGTCGTGGCAGGGCGCGCTGTTCCGGCGGCACGGCCTGCAGCCGCTGCAGATGCTGCTGTGTGTGAGCTTGTGCTCGTGCTCGCTGTCGGCGGCGGCGCTGCTCGGGCGGCCGCTGCCGGCGCTCATCTCCCAGCCGTCGTTCGTGGCGGACGCGTGCCTGCTGGCGCTGAGCTCGGCCGCGGGTCAGCTGATCATCTACCGCACCATCGCTCGGTTCGGTCCCGTGGTGTTCGCTATCTGCATGACGCTGCGGCAGGCGGGCTCGGTGCTGTTGTCGTGCCTGGTGTTCGGCCACCGCGTGTCGGCGGGCGGCGCGGCGGGGGTGACGCTGGTGTTCTCGTCGGTGTTCCTCCGCCTGTACTGGCGCAGACGCCGCGCGCCGCTGGCCGCGCCGGGCGACAAGTAG

Protein sequence:

>DPOGS209052-PA
MRSKFVIGTVLGLFVLFGWLLGHIYGVLLAVYEETTIFKDLEYSWVFRLLLNLVGYSTVILPCFALYKYLEKTHYFEKITNNNWVSRLLRTMFLEQERLPEVVRVDESLPHESVELALCVVGLMGAYLVWGLLQEKIMTTDYVLSDGSLCRFTDSQFLVFVNRVLGSLVALVRLRATRRPLFPAPLYKFSYCALTNIVSAWCQYEALKFVSFPTQVLSKSCKVIPVMLMGKLISRAKYESYEYVTAVLISLGMALFLFGTGEDHAWGAPSVSGACLLVLYLCCDSFTSSWQGALFRRHGLQPLQMLLCVSLCSCSLSAAALLGRPLPALISQPSFVADACLLALSSAAGQLIIYRTIARFGPVVFAICMTLRQAGSVLLSCLVFGHRVSAGGAAGVTLVFSSVFLRLYWRRRRAPLAAPGDK-