Monarch geneset OGS2.0

DPOGS210050
TranscriptDPOGS210050-TA2253 bp
ProteinDPOGS210050-PA750 aa
Genomic positionDPSCF300017 - 1104281-1107933
RNAseq coverage879x (Rank: top 14%)
Annotation
HeliconiusHMEL0104230.078.70% 
BombyxBGIBMGA012689-TA0.070.47% 
DrosophilaCG32676-PA2e-4975.00% 
EBI UniRef50UniRef50_E3WPE57e-5868.90%Putative uncharacterized protein n=2 Tax=Coelomata RepID=E3WPE5_ANODA
NCBI RefSeqXP_001862643.11e-5551.11%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3123833092e-5768.90%hypothetical protein AND_03596 [Anopheles darlingi]
NCBI nr blastxgi|1700533746e-9636.57%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00055157.2e-07protein binding
KEGG pathway 
InterPro domain[20-90] IPR0006267.2e-07Ubiquitin
Orthology groupMCL20383 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210050-TA
ATGGAACGTGCAACTGGAACAGAAGTGTACGGGTGTGGCTCGCCCCAATCGACGGACATAACGCTAAATATACAAACGACTACGGGTGGTAATTTCTCCATTAGTTTGAACGGTAAAAATACTGTGGAGCATCTTAAAAAAGTCGTTTCCAAGAAATTAAAAGTGTCAAAGGATCGAATATGCCTTCTGCATCGCGAAAGACAACTAAGGGATGGTACTCTCGAAGACAACGGTTTGTTAGATGGGTCGCGCATCATTTTACTGCCCAGCGTCGAGACGGGCCTTCTGAGTCAGAGACCTGAAAATTCTGTGATGCAAGCCTTAGAGTCCTTGAATGACACGCAAGTGAATGATTTTCTGAGTGGTAAATCACCTCTGAATTTGACAATGAGACTGGGTGACCACATGATGTTAATACAGCTACAGTTGTCAACATTACACACATCTTCCTCAAGTGGTAGTCGCACACTGAGAACTTCCACTCCAACTAAAACTACCACAGTCCGTACAAAATGTGACTCTAAGGACACACCCAATCCTCAGACTATACCACAAAAGCCTCCACCAAGTGACGAGACTATTACTAATAGTGATAAGAGTAAACAAAAGAAAGAAGATGACAATGTAGCCAATAATCTGAAACTAGATATGTTGCAAAATGCATTTGATCTGTACCGTACAATGGCAGAGGAGAAATATTCAGAGAAATCAGCTCAAGGTTCAGATAAAGAGGAAATGATGGATACCTCAAACTGCACTTTATCAGATTTATCCAACACATCGGTTGAAAATGAACAGTCCCCAATTAAATCACTATCTAATCTGGTCTCTAGTCCAATAAACACATCCGCTGAAAGCAAAGAGGCGGCTATAGCAAGTTCTGTAAAAAATAGTTTAATAGACTTGTTGTCAAGCAAACTGGCAGCAGAAGCAATCCCTTCCACAAGTTCCATGTGTGACAAAGCATGTCCATCAACATCCAATACACCTGACAGTTCAGTCAGTGATGACAGCGACAATTTCACAGACCAGAGTTCCTTCCTCGCTGAAAGCACTATAGATGAATATCCCATGGAAAACTTATTTAACAGTGCCGTAGACAAGGGCTTGTTCGATGAGCAAGACGAATCCATGATGGACAGGGAGATATCGAACTTGGCGACCACAAGTCATGGGACTCAAGAATCATCTGGAGCCTTGCCATCCTTCCAAAGCCTAAATGAACCTCTAAAAAATAAACGTTTTCAATATTTACTTCACAAAACATCCAAATTCAAGCATCCTATTGGCCACAATAAACAAAAGGCCTTCATGCAGTCTGTTGTAAATAAACATAAAAAGAAGGTGCAGCCCCTGTGTGCCCCAGACTTAAGTTTCCCTCAACCGTCTACCTCAAGAACAGAGCCGTATGTAAGCTGTAAAAAGCCTGTTCAGGTATCTGGGCCCACAGCAGAGACTGCCACTCCATCCACATCCAAACAAGCCTTCAGGGCTCCCGATGCACCTAAAAAACCACAGAGGGCTTCCCCAACACCACCGGTAGACACCAAAGCGCTCATTGAAGCCTCAAAGAATCTGACACAGAAGTTGAAGAAGTTGTCGAAGGAGGTGTTAACAAACAAAGTAGATCTCAAAGCTGTAGAGGAGCCGGTGCGTGCGAAAGTACGTCCCGGAGCTGTGATAGAGTCCATGAAGCATCATGGCAAAGGAATATATTCAGGGACGTTCTCTGGAACACTGAATCCAGCTCTTCAAGACAGATACGGCCGACCAAAGAGAGATATATCCACGATAATCCACATCTTGAATGATCTGTTGTGTGCGACGCCACCCATAGCCTTAGCGCAGAAGGAGCCGAAGCATTCTTGCGTGACGGTCACTCAGGAGGAGCCGAGCAAGTACAGCGGTTGTCACACGTGTTCGTACGGACATTGCGACGGACATCCGACCAGCAAGAACTGTGTGTGTGATAAAGACTGTCAGTGTAGTCGTCTAGAACCTAACGTGTGTGCGTCCTGTGACGAGAAAACCAGCCCGGGGGAGCTCTGCAAGAAGTGCGACAAAGCGAAGACGTTCGCTCTGGAGAACTCGAAGACTAAATGCAAACTGGAACAGCTGAGGCTCGTCATGCAACAGACGAAGCAACGACGCGAGGCGCGCAAGTTAAAAACATTACCTTACACCACGCCTTCCAAGACACACGCCTCAAATGATACCACATCCATCAAAGAAGAGATCGAAACGGCCGCATAA

Protein sequence:

>DPOGS210050-PA
MERATGTEVYGCGSPQSTDITLNIQTTTGGNFSISLNGKNTVEHLKKVVSKKLKVSKDRICLLHRERQLRDGTLEDNGLLDGSRIILLPSVETGLLSQRPENSVMQALESLNDTQVNDFLSGKSPLNLTMRLGDHMMLIQLQLSTLHTSSSSGSRTLRTSTPTKTTTVRTKCDSKDTPNPQTIPQKPPPSDETITNSDKSKQKKEDDNVANNLKLDMLQNAFDLYRTMAEEKYSEKSAQGSDKEEMMDTSNCTLSDLSNTSVENEQSPIKSLSNLVSSPINTSAESKEAAIASSVKNSLIDLLSSKLAAEAIPSTSSMCDKACPSTSNTPDSSVSDDSDNFTDQSSFLAESTIDEYPMENLFNSAVDKGLFDEQDESMMDREISNLATTSHGTQESSGALPSFQSLNEPLKNKRFQYLLHKTSKFKHPIGHNKQKAFMQSVVNKHKKKVQPLCAPDLSFPQPSTSRTEPYVSCKKPVQVSGPTAETATPSTSKQAFRAPDAPKKPQRASPTPPVDTKALIEASKNLTQKLKKLSKEVLTNKVDLKAVEEPVRAKVRPGAVIESMKHHGKGIYSGTFSGTLNPALQDRYGRPKRDISTIIHILNDLLCATPPIALAQKEPKHSCVTVTQEEPSKYSGCHTCSYGHCDGHPTSKNCVCDKDCQCSRLEPNVCASCDEKTSPGELCKKCDKAKTFALENSKTKCKLEQLRLVMQQTKQRREARKLKTLPYTTPSKTHASNDTTSIKEEIETAA-