Monarch geneset OGS2.0

DPOGS212969
TranscriptDPOGS212969-TA2067 bp
ProteinDPOGS212969-PA688 aa
Genomic positionDPSCF300057 + 445486-494636
RNAseq coverage612x (Rank: top 21%)
Annotation
HeliconiusHMEL0059773e-14580.40% 
BombyxBGIBMGA011612-TA3e-9576.13% 
DrosophilaCG33144-PB1e-7946.98% 
EBI UniRef50UniRef50_Q16LE56e-8650.85%Ubiquitin conjugating enzyme 7 interacting protein n=2 Tax=Aedes aegypti RepID=Q16LE5_AEDAE
NCBI RefSeqXP_967887.17e-9446.81%PREDICTED: similar to ubiquitin conjugating enzyme 7 interacting protein [Tribolium castaneum]
NCBI nr blastpgi|910795941e-9246.81%PREDICTED: similar to ubiquitin conjugating enzyme 7 interacting protein [Tribolium castaneum]
NCBI nr blastxgi|3838629873e-10654.15%PREDICTED: probable E3 ubiquitin-protein ligase RNF144A-like [Megachile rotundata]
Group
Gene OntologyGO:00082705.5e-14zinc ion binding
KEGG pathway 
InterPro domain[472-532] IPR0028675.5e-14Zinc finger, C6HC-type
Orthology groupMCL11694 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212969-TA
ATGTTGCGCCCGGAGTCACCCCTCGCTGATCACCGCAGCCTTCTGCCCCGCCGTCTTCTACCGGCCAAGGCACAAATCGCACCAGTTGATAAGAAGAAGAACAGACGATCTCTGGAACTAAACCCTCCGTCCAACGACCACGCCCCATACGACTATCCCAAACTCCTCATATCGAAGACCACTGATTCACTTAACACGGAATGCCCTCGGAAACTGATCAACAAACAAGACTCCTCTGATAGTCTCAAGCGAGCGTTGTTATCCAAAGAAGATAAGAAATGTGCATTGAAAGAAGTAGACAAGAAGGTCTTAACTAAACAGGACTCAAACGAGAGCCTCAAGAAGAACCCGCTCAGCCGACAAGATTCGAATGATAGCGCGAAAAGAAGAGACTCGGATCCGAAATGGCTGAGATCAGATGAAAAGAAATCTGTGGAGATTAAGAGAGGCGGTGATAAGAAGGGTCTACTGGAGACCGATATAGATGATGCGTGTAAACGTATCTACGAGAGGAGAACGGGCTGCGCTCGGCCTACTCTGCCTCCTGTGGCGGAGAAGGGCGTGTCCCCCAAATCACCGAACGGAAGCCCCATAACGCCGGGGGTGGTCGCGGTCGCTGAGGGGGTAGTGCGATGGGGCAGTGGACTACTGTCTCCCAAAGACGAGCCAAGACTGAGACCAGCCAGGAGCTGGTATGGATATGGAACACTGCCCACTGACGGTGACAAGATGGGAGCAGGCGCGAACGGACCTGGGGGTCGTCGTGCCCTGGAGCTCATATTAAGTGGAGGCCTCAAGTCGCTCGCCAGGCCCCCCAAACCGGTCCGAAGGGCGGCCTCCAGAGACTCTGTCCTGTCACAACCGCAGCCGTTGCTGGAAGGTCTTCGTAAATGTTCCACCGTGCTCGCTCTAACTGATCGTGAGAAGGCCCCAGAACCGATCAGAGCTCATAACAAACTGAGAGCACCCTCCGTCTGTTCCAGGTGTTCGTCACTGCTCTCACTCGCCGGGGCTGGAGGCTCCAGATACTCGTTAGACCACGCCGGTGGCTTCGTCCCAGCCGCCCCGATCAATTGCAAGCTTTGCTTAGACGACGCGACCTCCGATAACGTGACCGTCATCTCGGGATGTGGGTGCAGCTTCTGTACCAGGTTCTCGTCGTTGTGTATAGAGGTTGAATCGGGTCAGCCAGCCGACCTGACTGGCGGCAGGCTAAATGGTAAATGGGTCGCCAGTCACCTGCTTTACTTGTTCTATGACTGCAAAGTGAATACTAAATGGTGCATGAAAGCTTACGTGGAATTCGAAGTATGCAACGGTGCGTATGAAGTGTCCTGTCCCGATGACCGCTGCAGCGCTGGCGCAGCGCTGTCCTTGGACGAGATAGGACTACTCGTTGAACCATCCGTGATGGAGAAGCATCTCAAGTTCAGGCTTAATCATGAAGTGGCAATGGACGCGATGCGTGCTTTCTGCCCTCGACCCGGCTGTGACACAGTGGTACAGGTCCGAGCAGCCAGTCCAGCACATTGTCCAACATGCAGACACGACTTCTGTTCACAGTGTAACCAAGAGTGGCACGGCGGTATATCCTGTGAGGCAGCTGCGGCGTCATCCTCTATGGGTGGTGCCGGAGCTCCTCTTCTGCCAGATTCCGAGCTGATCAAACTCTGTCCCATGTGTCGAGTCCCCATAGAGAAGGACGAAGGCTGCGCTCAGATGATGTGCAAGAGATGTAAACACGTCTTCTGTTGGTACTGCCTCGCCTCGCTTGATGACGACTTCCTTCTGAGACACTACGACAAGGGACCTTGCAAGAACAAATTGGGCCATTCTAGGGCATCAGTGTTATGGCATCGTGCTCAGGTGGTAGGTATCTTCGCTGGTTTCGGTCTACTCCTGCTCGTGGCCAGTCCACTGTTGCTTTTGGCCGCGCCCTGCATCGTGTGCTGCAAGTGCAGACTCTGCAATCCCAATACGAAGAACTTGGAGGAAGTGGATGAGATAGAGAGCATCAGTCCCGGCCGAGAAGACGACGACGAACGTACGAGATACATCTCAGACTGA

Protein sequence:

>DPOGS212969-PA
MLRPESPLADHRSLLPRRLLPAKAQIAPVDKKKNRRSLELNPPSNDHAPYDYPKLLISKTTDSLNTECPRKLINKQDSSDSLKRALLSKEDKKCALKEVDKKVLTKQDSNESLKKNPLSRQDSNDSAKRRDSDPKWLRSDEKKSVEIKRGGDKKGLLETDIDDACKRIYERRTGCARPTLPPVAEKGVSPKSPNGSPITPGVVAVAEGVVRWGSGLLSPKDEPRLRPARSWYGYGTLPTDGDKMGAGANGPGGRRALELILSGGLKSLARPPKPVRRAASRDSVLSQPQPLLEGLRKCSTVLALTDREKAPEPIRAHNKLRAPSVCSRCSSLLSLAGAGGSRYSLDHAGGFVPAAPINCKLCLDDATSDNVTVISGCGCSFCTRFSSLCIEVESGQPADLTGGRLNGKWVASHLLYLFYDCKVNTKWCMKAYVEFEVCNGAYEVSCPDDRCSAGAALSLDEIGLLVEPSVMEKHLKFRLNHEVAMDAMRAFCPRPGCDTVVQVRAASPAHCPTCRHDFCSQCNQEWHGGISCEAAAASSSMGGAGAPLLPDSELIKLCPMCRVPIEKDEGCAQMMCKRCKHVFCWYCLASLDDDFLLRHYDKGPCKNKLGHSRASVLWHRAQVVGIFAGFGLLLLVASPLLLLAAPCIVCCKCRLCNPNTKNLEEVDEIESISPGREDDDERTRYISD-