Monarch geneset OGS2.0

DPOGS202836
TranscriptDPOGS202836-TA2400 bp
ProteinDPOGS202836-PA799 aa
Genomic positionDPSCF300018 + 896818-905624
RNAseq coverage367x (Rank: top 32%)
Annotation
HeliconiusHMEL0062911e-12053.67% 
BombyxBGIBMGA010509-TA0.052.17% 
DrosophilaCG7206-PB3e-1433.80% 
EBI UniRef50UniRef50_Q4RJ231e-3338.37%Chromosome 1 SCAF15039, whole genome shotgun sequence n=4 Tax=Tetraodontidae RepID=Q4RJ23_TETNG
NCBI RefSeqXP_002422891.11e-2429.45%ubiquitin-protein ligase BRE1, putative [Pediculus humanus corporis]
NCBI nr blastpgi|472221835e-3338.37%unnamed protein product [Tetraodon nigroviridis]
NCBI nr blastxgi|3071775994e-4224.58%Uncharacterized protein C1orf26 [Camponotus floridanus]
Group
Gene OntologyGO:00055152.8e-05protein binding
KEGG pathway 
Orthology groupMCL22240 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202836-TA
ATGAGTTCCAAGAAATCTGATAATAATAAGCTACCAGATGGTTGGGTACTATGTCAATCAAAATCAATTCCGGGAAGGAAGTATTTTTTTAACAAAAAAACTGGGAAATCTTCGTGGTCCCAGCCCCAGGCTGATGACAAATCTGGTAGGATGACGTGTAAAAAAGAAAAATTAAGGAGACAAGAAAGAAAGCGGAGAGAGGAACTCGCCAAAAAGGCAGCCAAAAGGAAAAGTGATGTTAATGATGGAAGTGATTCAAGGAAAAGAACCAAAGGTGCTTCTGATGCAGGGTGCAGCTCTAGAGGTAGGCTGAAAGAAAGTCCTCATCATCATAGAAGGAGAAAAGAGTCATCACAACCATCACACAGCACTCCAAAAACCACTCCATCAAAATGTACACTCAGTAACTCTGTGTCTCCACAAAAACATTCCTCTTCAAAAAATACAGCAAATACAAGGCTTTCAAATCTAAGAGCAAAACTGGCAGCAGAAGTAAGAGAAGAAGACCTCAATCTTAAAAAGAAGGTCAGTCCTAAAACAAATAACGAAACACCACAGAAGAGTCAAAGAAGTGAGCAGAAAAGTCCCAGCCTAAATGATTCCAAATCAACTCAATCACCGAGCAGTGATAGTTCTCAATCAATGCCCTCACCCTCGCAGTTCTTTGCTGCTAATAAGATCATATCTTCAATGAAGGCTCAGTTGCCTGAGGAGTATTGTAATAAAGAGAAACAGAAGGATACTTTTGCGGATATTGAAAAGGGTATATGTAGCCAAACAACCGAGTACCCGTTTTCAAAGCACCCCGCTACACCCCCACAGTTCTCGGAAGCCAGCAAATTAGTATCGGCTATAAAATCCAAATTGGCATACAAAGGCTCCTCTTCGAAGGAAGGACTCAAAACATTTCCGTCAGCCAATGCCAGGCTGGAAGTTTTGAGGGCGAATCTGAGTATGGAGGCAGAACAAGAAATGGGGGATAGTTTTTTAAGTAAAAGTAATGATTCAGAACAAAGGGACATATCAGAGGCAATGGAGGTTGAAGAAATAAAAGAGGCCAAACATAATGAGACGTTTCTGAGGGACAGCACGTCGGGAGACGACCTGGTTTTGGTCATTGACACTAATATATTTATTCATGACCTTGACTTCATCAAGGTCGTCCTCAACTCACATATCAAAGGCTACAGCGAACAACCAACAGTCCTCGTTCCCTGGCGAGTGATCAACGAGCTCGACCGTCTTAAAGACAACAACAACGGTAACGGTTCCCTCTGCAAGAGGGCCAAGGCTGCTATGGACTATCTGTATAAATCACTACCAGAGAACAGCAGAATTAAAGGTCAATCATTGAGGGATGCCAATTCTCACATATACCCGTGCGAGGTTCCAGACGATGAAATATTGAACTGCAGCCTGCAGCAGCTGGAGAGAGACAAGAATGTTATACTGCTGACCAACGATAAGAACCTGTGTAACAAGGCCAGCATCAATAACGTGAAACACAGCAACGTCAGCGAACTGCAGAAACTGGTGGAGAACAAGCCACAGCCGCAGACCAGCGACCTGCGGGCCACCGTCAAGAGATACACCGAAGGAGTCTACCACCTCCTTGCCAACATACTGGAGAATGAGATGCGAGCCAAGTACAATGAGCTCTGGCAGCACGTGGTGTTCAAGCCGCCGCCCTGGTCGCTGGACGACGTCCTGCAGTGCCTGCTGAAGCATTGGATCGCCGTCTTCAACGAGGTGTTCCCCAGGATCGAGCATCTGTTGGCCGACCTCCGAACCAGCCTCATAACGATCGAGAAAAAAGAGCCGAGCACCCTGACGCAGTCCGAGGTGTCGACGTTTAAGGAGTTGTGTGTGGACGTGACCCGCCGCTGTCAGATCATCCCGGAGTACATGGAGCTGGCTAAGACCACCCTCGCGCAGCTGACGCGAGACGGAATCGCCCCGGACACCGTGGACGCCTTCGAGGCGCTCTGGACCGTGCTCTCCAGCTACTGCGCCAAGCTGGCTTCAGCGTTGGGCGTGTCTCACTGTATCGAGGACTCGGTGGGCGGCGAGGAGGGTCTGCAGCAGCTGGTGTCCAGGGTCGCCTCCGTCAGCTCGCACGTCAACAACCTGGCCGCCGCACTAGCTGGGGCCCTGGAGGGGGGAGCGGGCGGGGAGGGCGCGGAGGGTGTGAGTAGTCCGTCGTCTCGTCTGCAGCACGCCGTGCTATCCGCGCTGGCTGACTGCGGCCTGCGGGCGGCGCTGCGGCGGGACCAGCTGGTCGCCTTCTGCCAAGACTGCAGGAACATGTTGCAGGAGGCTCACGACAAGTTCTCGCAGCTGTCCGAGCTGCTGAGCGTGTGTCAGGGCAGACTGGCCACCGCCGTCCGCGACATGAACTGA

Protein sequence:

>DPOGS202836-PA
MSSKKSDNNKLPDGWVLCQSKSIPGRKYFFNKKTGKSSWSQPQADDKSGRMTCKKEKLRRQERKRREELAKKAAKRKSDVNDGSDSRKRTKGASDAGCSSRGRLKESPHHHRRRKESSQPSHSTPKTTPSKCTLSNSVSPQKHSSSKNTANTRLSNLRAKLAAEVREEDLNLKKKVSPKTNNETPQKSQRSEQKSPSLNDSKSTQSPSSDSSQSMPSPSQFFAANKIISSMKAQLPEEYCNKEKQKDTFADIEKGICSQTTEYPFSKHPATPPQFSEASKLVSAIKSKLAYKGSSSKEGLKTFPSANARLEVLRANLSMEAEQEMGDSFLSKSNDSEQRDISEAMEVEEIKEAKHNETFLRDSTSGDDLVLVIDTNIFIHDLDFIKVVLNSHIKGYSEQPTVLVPWRVINELDRLKDNNNGNGSLCKRAKAAMDYLYKSLPENSRIKGQSLRDANSHIYPCEVPDDEILNCSLQQLERDKNVILLTNDKNLCNKASINNVKHSNVSELQKLVENKPQPQTSDLRATVKRYTEGVYHLLANILENEMRAKYNELWQHVVFKPPPWSLDDVLQCLLKHWIAVFNEVFPRIEHLLADLRTSLITIEKKEPSTLTQSEVSTFKELCVDVTRRCQIIPEYMELAKTTLAQLTRDGIAPDTVDAFEALWTVLSSYCAKLASALGVSHCIEDSVGGEEGLQQLVSRVASVSSHVNNLAAALAGALEGGAGGEGAEGVSSPSSRLQHAVLSALADCGLRAALRRDQLVAFCQDCRNMLQEAHDKFSQLSELLSVCQGRLATAVRDMN-