Monarch geneset OGS2.0

DPOGS202707
TranscriptDPOGS202707-TA1788 bp
ProteinDPOGS202707-PA595 aa
Genomic positionDPSCF300272 - 179111-180898
RNAseq coverage1x (Rank: top 93%)
Annotation
HeliconiusHMEL0078890.098.66% 
BombyxBGIBMGA008387-TA0.095.46% 
DrosophilaCG1972-PA0.083.05% 
EBI UniRef50UniRef50_Q5TA450.069.53%Integrator complex subunit 11 n=58 Tax=Chordata RepID=INT11_HUMAN
NCBI RefSeqXP_969343.10.082.69%PREDICTED: similar to CG1972 CG1972-PA [Tribolium castaneum]
NCBI nr blastpgi|910861470.082.69%PREDICTED: similar to CG1972 CG1972-PA [Tribolium castaneum]
NCBI nr blastxgi|910861470.082.69%PREDICTED: similar to CG1972 CG1972-PA [Tribolium castaneum]
Group
Gene OntologyGO:00167871.5e-15hydrolase activity
KEGG pathway 
InterPro domain[245-363] IPR0227121.4e-25Beta-Casp domain
[16-210] IPR0012791.5e-15Beta-lactamase-like
[377-417] IPR0111082.5e-13RNA-metabolising metallo-beta-lactamase
Orthology groupMCL13803 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202707-TA
ATGCCCGAAATTAAGATAACTCCTTTGGGAGCGGGTCAGGATGTTGGACGTAGTTGTATTTTGTTGTCGATGGGAGGTAAAAATATTATGTTGGACTGTGGTATGCACATGGGGTACAATGATGAGAGACGTTTCCCGGATTTTTCATACATAGTGCCAGAGGGTCCTATCACAAGTCAAATAGATTGTGTAATAATATCACACTTCCACCTCGACCACTGCGGTGCACTCCCATACATGTCAGAAATGGTTGGTTATACGGGACCTATTTATATGACTCACCCAACAAAAGCTATAGCTCCAATTTTGTTAGAAGACATGAGAAAAGTTGCAGTTGAAAGGAAAGGAGAATCAAATTTTTTTACTTCACAAATGATAAAAGATTGTATTAAAAAGGTAACAGCAGTGACACTACATCAATCTGTAATGGTGGATAATGAATTAGAAATCAAAGCTTATTATGCCGGCCATGTCTTAGGTGCTGCCATGTTCTGGATAAGAGTTGGATCACAATCAGTCGTATACACTGGAGACTATAACATGACACCGGACAGACATCTAGGGGCAGCCTGGATTGATAAATGTAGGCCTGATTTGCTAATATCAGAGTCAACATATGCTACAACTATAAGGGATTCAAAACGTTGCCGAGAAAGAGATTTCTTAAAGAAAGTCCATGAGTGTGTAGAGAAAGGTGGAAAGGTTCTAATACCTGTCTTTGCTCTCGGTAGAGCACAAGAGTTATGTATACTGCTGGAGACATACTGGGAGAGAATGAATTTAAAATACCCAGTGTACTTCGCATTAGGTTTAACGGAGAAGGCCAACAACTATTATAAAATGTTTATAACATGGACAAACCAGAAGATACGTAAAACTTTTGTACAAAGAAACATGTTTGATTTTAAGCATATTAAACCATTTGATAAGTCATACATCGACAACCCCGGTGCTATGGTGGTTTTTGCTACTCCGGGAATGTTACATGCTGGTTTATCCCTAAATATATTTAAGAAGTGGGCTCCCTATGAACAAAACATGTTGATCATGCCAGGTTTCTGTGTTCAAGGCACAGTTGGACACAAAATTCTAAATGGCGCGAAGAAAATTGAATTCGAGAACCGTCAAGTAGTTGAAGTTAAAATGGCTGTCGAATACATGTCGTTTTCAGCTCACGCAGACGCGAAAGGCATCATGCAGCTGATACAATACTGTGAACCAAAGAACGTGTTGCTCGTCCACGGGGAGGCACAGAAGATGGAGTTCCTAAAGGACAAAATCGAAAAAGAATTCAAGATCAGTTGTTACATGCCCGCCAACGGTGAAACGGCGATAATAAATACTCCAACGAAGATCCCGATAGACGTATCGTTACGATTACTAAAGGCAGAAGCTGTGAGATACAACGCCCAGCCTCCAGACCCGAAACGGAGAAGAGTCGTACATGGAGTGCTCTGTGTTAAGGACAATAGATTATCATTCTTAGATATAGATGAAATGTGCGACGAGATCGGCATTAATAGGCACATCATTAGATTTACTAGCACGGTGCGTTTTGATGACGCCGGTTCAGCGATCAAAACTGCTGAGAAACTCAAGACATTGCTGGCAGAGAAGCTTCAGGGTTGGTCGATTACTATATCCGATGGAAACATCTCAGTTGAGTCCGTCCTCATCAAAGTTGAGGGGGAGGACGATAACACGAAAAGCATCTACGTGTCGTGGACAAACCAGGACGAGGATCTGGGCAGTTACATTCTAGGTTTACTGCAGTCTATGGTACAGTGA

Protein sequence:

>DPOGS202707-PA
MPEIKITPLGAGQDVGRSCILLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSQIDCVIISHFHLDHCGALPYMSEMVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTSQMIKDCIKKVTAVTLHQSVMVDNELEIKAYYAGHVLGAAMFWIRVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFLKKVHECVEKGGKVLIPVFALGRAQELCILLETYWERMNLKYPVYFALGLTEKANNYYKMFITWTNQKIRKTFVQRNMFDFKHIKPFDKSYIDNPGAMVVFATPGMLHAGLSLNIFKKWAPYEQNMLIMPGFCVQGTVGHKILNGAKKIEFENRQVVEVKMAVEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEAQKMEFLKDKIEKEFKISCYMPANGETAIINTPTKIPIDVSLRLLKAEAVRYNAQPPDPKRRRVVHGVLCVKDNRLSFLDIDEMCDEIGINRHIIRFTSTVRFDDAGSAIKTAEKLKTLLAEKLQGWSITISDGNISVESVLIKVEGEDDNTKSIYVSWTNQDEDLGSYILGLLQSMVQ-