Monarch geneset OGS2.0

DPOGS206476
TranscriptDPOGS206476-TA1527 bp
ProteinDPOGS206476-PA508 aa
Genomic positionDPSCF300070 + 344790-353576
RNAseq coverage105x (Rank: top 60%)
Annotation
HeliconiusHMEL0140003e-10262.17% 
BombyxBGIBMGA005362-TA5e-4896.70% 
DrosophilaCG6982-PB4e-6964.64% 
EBI UniRef50UniRef50_Q7QFF51e-6764.09%AGAP000456-PA n=5 Tax=Diptera RepID=Q7QFF5_ANOGA
NCBI RefSeqXP_624071.12e-7668.95%PREDICTED: similar to CG6982-PA [Apis mellifera]
NCBI nr blastpgi|3071946192e-7869.90%Transmembrane protein 47 [Harpegnathos saltator]
NCBI nr blastxgi|3071946193e-7869.90%Transmembrane protein 47 [Harpegnathos saltator]
Group
KEGG pathway 
InterPro domain[322-500] IPR0156641.6e-45P53-induced protein
Orthology groupMCL16366 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206476-TA
ATGAAAACTGACAAGTGCAAAAAGCCGAGTATTGGATTAGTCTACTTGCTTCCGAGTTTTGCACCGTTAGTATTCATCAGTATCGTGATGCAAACGAACGGCAGTCTGTCAGCGGGTTCTTTGTGGCAGCTGTGGCAGCCGGCAGAGCTCGCTCGTCGTATTATTGGAGAGCGACCCTCTCCTGTCCCCCCCACAAGGGAATTATGGCCACCTCAGGAACGTTTAAACATCAGTACCTTGCAACTCGCCTACCAGCAGGAACTGTATCAGAATGAAGAGTTGTCACCAAGACGACGTTGTTCATCGCTAACGTCTCGACCATATGTTTCCACCACTGTACCCATAGACAGCGATGTGCCGGTACGAATGTGTACTGTAGGAACGAATACTGACCCACCTCCTACTATTCTAAGTCGGATTAGGGGCCTGCTAAGGAGAGATAGTCATTTATATACTTACTCGCCGTCGTTACCACTGAAATCAATCAGTCCAGGACCTAAAGTAGAACAATACGAGGACTCTTCGAGTAATCGTTCTGCATATGGAATAAGGAAGGTGATGGCTGGAACTAAATATCGTATAGCACCAAACGCTGACGAAATTGAAAGTCCGAAAATAGTTTACAATACAACAAAACCAGCTGATCGAATGGTTACTACTTTTGGAGCTAAAAAATCCAGTAAATGGTTTAAACTTCCAGCTAGATCCGATTGTTCGCGTCATTTCCGAGCACTTACAAAGGTGTGCGGTCAGCGATCAGAAGAATCATCTCGCCGACAGCCAAACCTGACGCCCTTACAAATAACTCAGTGCGTGGTATCAGATCTCGTCAAGTGTGTGATGTCAGGGCGCGCGGGAGCCGGCGCGGGGGGGCGCTCGCACCCGCGCGCCCTGCCGCCCGTACTGCCGCCGCCGCCGGAGTTCGCTGAACAAGACGACATGCCCGCCAATCAGATAGCACAGGTGATCGCTCTAATATGTGGTTTATTAGTGGTGATTCTGATGGTTTTGGGCCTGGCGTCCGCTGACTGGTTGATGGCGGCCGGCTGGAGGCAGGGTCTCTTCATGCACTGCATAGACCCCGCTGCTCCAACACCGCTGCCCTTCGATATAACAGCGCAACCAGGATGCTACGCCGCCAGACCGGCGCCATATATTAAGGCTGCGGCTGGCTTGTGTGTGGCCACACTAGCAGCGGACGTCTGCGGCGCTCTTCTGACAGGTCTAGGTCTTCGATCAGCTGATCACCGTACCAAGTTTCGGTATTACCGTTTCGCTGTTCTCGCCATGTCCCTCGCACTGATGTGCATTTTGATAGCCCTGGTAATATACCCGGTCTGTTTCGCTGCTGAGTTGAATCTCGGTAACAGATCGGTATGGGAATTCGGTTGGGCTTACGGCGTGGGTTGGGGAGCCGCCATCTTTCTTTTCGGAGCAGTTGTACTTCTGCTTTGTGATAAGGAGAGCGAGGAGCTCTACTACAAGGAGCGAAAAGTGGTGAGCGGGGAAGGTGGTGGGCGCCCCTAG

Protein sequence:

>DPOGS206476-PA
MKTDKCKKPSIGLVYLLPSFAPLVFISIVMQTNGSLSAGSLWQLWQPAELARRIIGERPSPVPPTRELWPPQERLNISTLQLAYQQELYQNEELSPRRRCSSLTSRPYVSTTVPIDSDVPVRMCTVGTNTDPPPTILSRIRGLLRRDSHLYTYSPSLPLKSISPGPKVEQYEDSSSNRSAYGIRKVMAGTKYRIAPNADEIESPKIVYNTTKPADRMVTTFGAKKSSKWFKLPARSDCSRHFRALTKVCGQRSEESSRRQPNLTPLQITQCVVSDLVKCVMSGRAGAGAGGRSHPRALPPVLPPPPEFAEQDDMPANQIAQVIALICGLLVVILMVLGLASADWLMAAGWRQGLFMHCIDPAAPTPLPFDITAQPGCYAARPAPYIKAAAGLCVATLAADVCGALLTGLGLRSADHRTKFRYYRFAVLAMSLALMCILIALVIYPVCFAAELNLGNRSVWEFGWAYGVGWGAAIFLFGAVVLLLCDKESEELYYKERKVVSGEGGGRP-