Monarch geneset OGS2.0

DPOGS209209
TranscriptDPOGS209209-TA2196 bp
ProteinDPOGS209209-PA731 aa
Genomic positionDPSCF300061 + 886786-897619
RNAseq coverage487x (Rank: top 26%)
Annotation
HeliconiusHMEL0147731e-13155.54% 
BombyxBGIBMGA001330-TA2e-15047.05% 
Drosophila% 
EBI UniRef50UniRef50_UPI0002247BA47e-2928.50%UPI0002247BA4 related cluster n=1 Tax=unknown RepID=UPI0002247BA4
NCBI RefSeqXP_001600758.16e-3028.23%PREDICTED: similar to FNBP4 protein [Nasonia vitripennis]
NCBI nr blastpgi|3454968943e-2828.50%PREDICTED: hypothetical protein LOC100116220 [Nasonia vitripennis]
NCBI nr blastxgi|3454968942e-4027.60%PREDICTED: hypothetical protein LOC100116220 [Nasonia vitripennis]
Group
Gene OntologyGO:00055153.9e-06protein binding
KEGG pathway 
InterPro domain[56-84] IPR0012023.9e-06WW/Rsp5/WWP
Orthology groupMCL22657 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209209-TA
ATGAGTAAAGCAAATCCGTTAGGCCTCATCGCTGCTTACGGTGATTCTGACGAGGAGTCTGATGATGGGTCGGCTGTAAAGTGTGATAACCGAGCTTACTTCACCGCATACGAAAACAAACCCTCAACAGTACAGGCTGGCATCCATCCTTCGCCGATAGCACATTGTCCATGGTCAGCCTGTTATGATGAAAACAGCGGCTTCACATACTATTGGAACCAGCAAACCAATGCGGTGACCTGGGAGGCACCAACCGAGTACCTTTTGGCTCTAAAGCTCGCCCAACAGCAGTTAACCACAGCAGGTTCTACAGAAGTGTCAGCTGAGGAATGGCAACTCTACCAACAGGCATTAGCGGAGAAACAAACACAGAAGACTACACCCAGCACTAAACCTGCCGCAAAGAAGACGGAGAAAAGTGTTAAAAGCAATAAGAAACGGAGTAAAAGCGATGATGAGGAAAAAATAGAACTCATAACATCGTACCACAACTCGGATTCCGAGTCCAACGACGAAACAGAGAAGTCGCCGACACTGCCAACACCCAAACCAAGTAAAATAGCAAAAGTTGTACAAAAAAAACCCAGAACGCTGGACCACAAGCCGCCGCCATCGGCAAACCAGAATTATATGGTGCCCATAGGACCGGAACTGCCCCCCGGACTGGAAAACAAGTCAACAAAACATGTGTTAAACGAGGAGGTGAAGTTGGCACAGAAATCGGAAAACGACTGTCCCGAGGAGAAAGTGCTGCTTAGGAAGTTGAAAGATAAAGCGAAGCTTCTAGAAAAACTAGGAGGCGAGTTGCCTTCGGACGTGCAGGAGATTATCAAGGATGACATGAAAGCGGTGGACTCGCCGAAATCTGACAAGGACGTTACCGATATTGATGTTCTGTTGGAAGAAATAGAAAAGAAAGAGTTGCCCAAAGTTAAACCGAAGACAAATGAAGGCGGATCCAAGTCTGGCAGTAACTCGCCTAGAGATAAAACAACACCTTCAGACGATGCGGAACCAAAAAATACTCCCAACGGCCCGAGCTTGTTTCCCAGCGCCGCTTTTATAGATAAAAATAGCATAGATAAAGACACACCACCCATAGACAATCAGAAGAAGGAGAACTTGTATTTAATGAACACGGAGACCGTAGACAATGTTAACAGAAAAAGATTGCGTATTTCAAATTCAGTGTTGCCCGATCGGAAACGGGATAAAGTCGAAATACCGACTTACACGACGAAGTATGCGCAGTTCGTTGAGGGCGTTTCGAGCGAAAGGACGGGTCTGGGATTCAGTAAGGACGAAGAATATACGGAGAATCCAAAAAACACTATCAGCTACGGCAACGGATTGACATTCACTAAAGGAGAGACTTTGAACGAGGAGAAGAAAGACGAGGATTTGGATGATCTGACGGACCTCGTGGAGGCCAAGCTGAAGTTCCTCAGCCAGATACAACCATACACCCTCACCACCATCCAGGAGATGATGATACAGATGCAGACTTTGTTATCGGCGTTCCGCGCGGGTGCGTTATCCTCGTCATACTGGCGGCGGTGGGCGGGCGGCGCGCGGTCGACGCTGGCTGCTCACGAGGCGGCCGCCGCGCCGCCCGGCTGGAAGTGCGTCTTCCTGAGGTACACACCCGCTTATATACCACCCCCACCCCCACCACATCCCCACCACGCGAATGCGCTCTTGTCTGAAGGCCGGTACTGCTACACGAGGGAGGTTGACGGCTTCCAGCAGTACGAATATCCAGCGGTCGATACCACCGACATGGATATATCTACCACTCCACCACACGAACCCAAGGAGGAGAGCTGGCGCGCGACTCCCCCTCCGCCCGGAACTGATGATGCGGAACAAACGATCAATGATGCGACATCAAAGAAGGAGATCGGGGACGAGCTGCAGTCCTTCTACAACGACCTCGCCGAAATAGAGAAGAGTTCAGGAACGGAACCCAACTCACCGGAACCGCCGCAGCCTCCGCCGCCGCCCGAGATCAGCGACACCGTGAGGGAGATGAGAGAGATGAGGGAGACGAGGGAGATGAGCAGGGAGAAGGACGTCCGCCTTAATAGGAAGAAGTCAAAGGTCAAACTGTCCACGTGTATCGGCATGAAGCACAAGTCCGTGTCCAACTTGGTCGCCAAATGGCAACAAGTGGCCGAAGAAATAAACTCGGACTGA

Protein sequence:

>DPOGS209209-PA
MSKANPLGLIAAYGDSDEESDDGSAVKCDNRAYFTAYENKPSTVQAGIHPSPIAHCPWSACYDENSGFTYYWNQQTNAVTWEAPTEYLLALKLAQQQLTTAGSTEVSAEEWQLYQQALAEKQTQKTTPSTKPAAKKTEKSVKSNKKRSKSDDEEKIELITSYHNSDSESNDETEKSPTLPTPKPSKIAKVVQKKPRTLDHKPPPSANQNYMVPIGPELPPGLENKSTKHVLNEEVKLAQKSENDCPEEKVLLRKLKDKAKLLEKLGGELPSDVQEIIKDDMKAVDSPKSDKDVTDIDVLLEEIEKKELPKVKPKTNEGGSKSGSNSPRDKTTPSDDAEPKNTPNGPSLFPSAAFIDKNSIDKDTPPIDNQKKENLYLMNTETVDNVNRKRLRISNSVLPDRKRDKVEIPTYTTKYAQFVEGVSSERTGLGFSKDEEYTENPKNTISYGNGLTFTKGETLNEEKKDEDLDDLTDLVEAKLKFLSQIQPYTLTTIQEMMIQMQTLLSAFRAGALSSSYWRRWAGGARSTLAAHEAAAAPPGWKCVFLRYTPAYIPPPPPPHPHHANALLSEGRYCYTREVDGFQQYEYPAVDTTDMDISTTPPHEPKEESWRATPPPPGTDDAEQTINDATSKKEIGDELQSFYNDLAEIEKSSGTEPNSPEPPQPPPPPEISDTVREMREMRETREMSREKDVRLNRKKSKVKLSTCIGMKHKSVSNLVAKWQQVAEEINSD-