Monarch geneset OGS2.0

DPOGS208340
TranscriptDPOGS208340-TA1374 bp
ProteinDPOGS208340-PA457 aa
Genomic positionDPSCF300383 + 65229-70756
RNAseq coverage332x (Rank: top 35%)
Annotation
HeliconiusHMEL0139833e-11866.56% 
BombyxBGIBMGA004077-TA4e-10975.00% 
DrosophilaCG8195-PA5e-5630.75% 
EBI UniRef50UniRef50_Q7K3M97e-5430.75%CG8195 n=28 Tax=Coelomata RepID=Q7K3M9_DROME
NCBI RefSeqXP_001601634.13e-6838.34%PREDICTED: similar to ENSANGP00000014977, partial [Nasonia vitripennis]
NCBI nr blastpgi|3454926852e-6738.24%PREDICTED: solute carrier family 35 member F5-like [Nasonia vitripennis]
NCBI nr blastxgi|3838637034e-6738.87%PREDICTED: solute carrier family 35 member F5-like [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL12123 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208340-TA
ATGTTGACTTTGTGCCATAATTATTTTCGTGTTTCGCCAAATAACAAAAACAATAATGATGGTGGGCAAGACTTTAGTAAACGAAAATTTGCGTTTGGAATATTTATTTTATTCCTTGTATGTGTATTGTGGGTGGTATCATCAGAGTTGACAAAGTACATATATATAGAGAAGAAGTTAGAAAGACCATTTCTTGTGACTTATGTTAGGAGTTCATTATTATTTGTCTATCTATTGGTATTATGTTTTACTCCACCAACAAAAGATCGATGCCAGCCTGCTGATTACACGCAACTTTTAGAACAGACAGAAGAAACAGGCGATGACAGCTTTTACACAGAAGGCAGTAGTAGCCTGGGTGACTCTACATTCGTTCCTGTGAGAGCCGGCGAAACAACTGACAGCGATGACCAATCTCCCAGGGCTGTGAGATTCAATAAAGTAGCCGAGGTGCGTGTTATGTCAGCGGCGATGGCTGGAGAGGCTCTGCTGGCTCGCCTGTCATGGTCGGCGTCAATCCGCGCCACAGAGTACGCACAACGTAGAGCAGCAGCGGCCGCTACAAGGAGACATCTCAGGCTGGCTTTAGTATTCTGTGTACCGTGGTTCATTTCCAACTACCTGTACCGGCTAAGTTTGTTGCATACAACGTCATGCTCAACTACCCTCTACACTTCAACAACCGGAGCATTTGCCCTATCATTCGGAGCAATGTTCTCTCAAGTGCCCACTGATAGATTTTCGACATCCAAATGTATATCTGTCCTGCTCACAGCAGCATGTCTGGTGGTATTAGGCATATCAGAGAGCCATCACAATAACTGGATGGCTGTTTTATCAGCTATTGGATCAGCGTTCTGTTATGCTTTACACCTCTTACTGTTCCGACAGGAATTAAGGAAGGGTGATGGCATTAATTCATTCCTATTAGTTGGTGTTGTTGGTTGTGCATGTGGCTGTCTTGCTTGGATAGCTGGCGGTCTATTGGCTGCTGGTGGTGTTGAACGAGCTGACCTTCCATCACCACGTCTATGGAGCTGGTTGTTATTGGACGCAGCTCTTGGGCCACTACTCATTGAATCATTGTGGTTGTGGGGTAGATGGTTGACGTCATCTCAGACTGCCACGGCATCCTTGGCGTGTATATTTCCATTGTGCGTCTGTGTGGAGGCGTGGCGAGGGACGGGCAGTGTGTGGCGCATAGCGGCGGCGGTACTCGCGGCTGGTGCATGGGTTTTGGCCACTCTGCCTGTACACAAACCACTCGGTTGGATCACGGCTAGGATACGAACGGATGGATTCACGGCTATACGTAATATATCAGATTTGGACGAACAATCAGAAGCTCTCATGTCATCCGATGAACCGACGTAA

Protein sequence:

>DPOGS208340-PA
MLTLCHNYFRVSPNNKNNNDGGQDFSKRKFAFGIFILFLVCVLWVVSSELTKYIYIEKKLERPFLVTYVRSSLLFVYLLVLCFTPPTKDRCQPADYTQLLEQTEETGDDSFYTEGSSSLGDSTFVPVRAGETTDSDDQSPRAVRFNKVAEVRVMSAAMAGEALLARLSWSASIRATEYAQRRAAAAATRRHLRLALVFCVPWFISNYLYRLSLLHTTSCSTTLYTSTTGAFALSFGAMFSQVPTDRFSTSKCISVLLTAACLVVLGISESHHNNWMAVLSAIGSAFCYALHLLLFRQELRKGDGINSFLLVGVVGCACGCLAWIAGGLLAAGGVERADLPSPRLWSWLLLDAALGPLLIESLWLWGRWLTSSQTATASLACIFPLCVCVEAWRGTGSVWRIAAAVLAAGAWVLATLPVHKPLGWITARIRTDGFTAIRNISDLDEQSEALMSSDEPT-