Monarch geneset OGS2.0

DPOGS216010
TranscriptDPOGS216010-TA1305 bp
ProteinDPOGS216010-PA434 aa
Genomic positionDPSCF300078 + 516043-519363
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0048150.083.93% 
BombyxBGIBMGA000939-TA2e-13273.62% 
DrosophilaCG14689-PA2e-3230.00% 
EBI UniRef50UniRef50_D6WC588e-6237.61%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WC58_TRICA
NCBI RefSeqXP_972004.11e-6237.61%PREDICTED: similar to CG14689 CG14689-PA [Tribolium castaneum]
NCBI nr blastpgi|910780783e-6137.61%PREDICTED: similar to CG14689 CG14689-PA [Tribolium castaneum]
NCBI nr blastxgi|910780781e-5837.61%PREDICTED: similar to CG14689 CG14689-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL11818 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216010-TA
ATGGCTGATGAAGAAGATGAGGGTGGCTTTGGAGGTTTATATGAAGATGGCCATTGGGTTTGGGACGAAGAAACAGAAGGTTTAGTTTTCGTGAGCGATTTGCCACCGAAACAGGTAGAGATTATTCAAATGCCAACTAAAGCACCCACAGGGACAATAGACTTTAGAGATGACGTTGATTTAATCGAACAGATAAGATTTCGTCGGCGCTATCAAAGAAAATTAAAACCTGGACAAGCAGACGTCATTACCGTACAGGATGTGAAGGACATAGCTTTGTTTACAGCACCAGTAAATATTTTAAGTCCCGTGTTAATAAACATGCTGCATCTACCTACTACAGAGAGATTTATAAGGGCGCTAGTGCTTTGCTGTCAGTATTATTTACAGATTGCCAATGAAATGACAGAACGTATAATAAAACTGGAGAGTAAAGTAAGGACACCGCAATGTGAGATTGTCGAAAGCGAATTTCGTGACAACTTGTCCGATTTAAGGCTTCTGGTCGCAAAGGAATATAGCACGATGCTTATTGATACTAAAAAGTTTCATCACATGGGCCCACAAAAAAAACGACGATCTCTATCGGATAAAGATGCAAGATTTTTCGAAACATTGCTACGTATGTGCGTTCAAATTGTGTGGCTGGCCTTAGGGCGAAAGGCTTTCAATCAGATAGAATTGGAAGTGAATAGAATGTTCAAATCGGAGATATTTAACGCTGTGGAGCATACACTGAAGACGGGGTATATATCAAAAATGGTGAAAGAAGAACGCCAAGTTCTGTTGGGGAAGTGCGTCCGTCAAGATAAGAAGCTAAACACGCGTTCGCCGCTCATGAATGAAGTGTTCTGCCATCGCGAAATCGACTACCGTATGATGGGACTCGGTGTTATTAAATGTCAACAATTGAATCCTCGTATATTATATATGCTACAAGCTGTGGCCGGTCCAGAGGAGAATTTGTCAGAATTGGGTATAGTTCTCGGCATCATCGGGATGCCTCGAAATGTTTTCGATACAATGCTGCGACCCATACCGACGGGTGGCGATACCTCCAAATCGAGAGCTTCTGTGTCTACTCAAAGCTATGGTTCAAACAAGAGTGCTGTTAGTGCAAGTTCAAGAAAAAGTCAGCTGTTTATGCCAACTCTCTATCCAAATATTGTACTTCCAGCGAAGGAATCCGAATTAACGTATTTTCCTTCACAATTCTCCGACGAAACTGAGGTTATTCGACCCTGCAGTGAAACTCAAAGACGGCGTTGGCAGAACAGACTACAGAGACTTTTACATCCACACTAA

Protein sequence:

>DPOGS216010-PA
MADEEDEGGFGGLYEDGHWVWDEETEGLVFVSDLPPKQVEIIQMPTKAPTGTIDFRDDVDLIEQIRFRRRYQRKLKPGQADVITVQDVKDIALFTAPVNILSPVLINMLHLPTTERFIRALVLCCQYYLQIANEMTERIIKLESKVRTPQCEIVESEFRDNLSDLRLLVAKEYSTMLIDTKKFHHMGPQKKRRSLSDKDARFFETLLRMCVQIVWLALGRKAFNQIELEVNRMFKSEIFNAVEHTLKTGYISKMVKEERQVLLGKCVRQDKKLNTRSPLMNEVFCHREIDYRMMGLGVIKCQQLNPRILYMLQAVAGPEENLSELGIVLGIIGMPRNVFDTMLRPIPTGGDTSKSRASVSTQSYGSNKSAVSASSRKSQLFMPTLYPNIVLPAKESELTYFPSQFSDETEVIRPCSETQRRRWQNRLQRLLHPH-