Monarch geneset OGS2.0

DPOGS213526
TranscriptDPOGS213526-TA1494 bp
ProteinDPOGS213526-PA497 aa
Genomic positionDPSCF300033 - 736059-738046
RNAseq coverage188x (Rank: top 48%)
Annotation
HeliconiusHMEL0136740.075.88% 
BombyxBGIBMGA011808-TA0.070.99% 
Drosophilaldbr-PA1e-12350.35% 
EBI UniRef50UniRef50_E9HAB24e-13749.70%Putative uncharacterized protein n=2 Tax=Coelomata RepID=E9HAB2_DAPPU
NCBI RefSeqXP_001846311.17e-14059.43%lariat debranching enzyme [Culex quinquefasciatus]
NCBI nr blastpgi|1700369251e-13859.43%lariat debranching enzyme [Culex quinquefasciatus]
NCBI nr blastxgi|1700369253e-13852.04%lariat debranching enzyme [Culex quinquefasciatus]
Group
Gene OntologyGO:00063973.1e-34mRNA processing
GO:00167883.1e-34hydrolase activity, acting on ester bonds
GO:00167879.1e-13hydrolase activity
KEGG pathway 
InterPro domain[237-371] IPR0077083.1e-34Lariat debranching enzyme, C-terminal
[1-229] IPR0048439.1e-13Metallophosphoesterase domain
Orthology groupMCL14067 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213526-TA
ATGAAAATTGCTATAGAGGGATGTGCACATGGTGAATTGGATAAGATATACGAGTGTGTAGAAACTTTACAGAGAAGAGAAGGAATAAACGTGGATTTGTTAATATGTTGTGGAGATTTTCAATCAGTCCGCAATAATGATGACCTTAGGGCTATGGCTGTACCAGAGAAATATCAAAACATATGTACATTCTACAAATATTACAGTGGTGAAAAAATCGCTCCAGTATTAACTTTATTTATTGGAGGCAATCATGAAGCTTCAAACTATTTACAAGAGCTGCCATATGGTGGCTGGGTTGCACCAAACATATACTTCTTGGGCAGAGCCGGTGTTGTACAGTTTGGCAATTTACGAATTGGAGGACTATCAGGAATATTTAAAGGCCATGATTATTTACAAGGTCTCTGGGAATGTCCTCCTTACACCCCTGGTTCACTGAGATCAGTTTATCATATAAGATCTCTGGATGTGTTTCGGTTAAGTCAAATGAAAGAAAACATCCACATCATGTTATCACATGATTGGCCGAGGGGTATCACTAGTTATGGGGATAAAGAGAATTTACTAAGAAGGAAACCGTTCTTACGAGATGATATTGAGTCAAACCAACTAGGTAGTCCCCCAGCGGAGAAGTTGTTACACACATTGAAGCCTCAGTACTGGTTTGCTGCACATTTGCATTGCCAATTTGCTGCCGTTATTAATCATGACAATAATCGGGAAACAAAATTTCTTGCTCTAGATAAATGTTTGCCACGAAGAAGGCATTTGCAAATATTAGATTTAGCAACAGAGTATGACGGTGACAAGACTTTAAAGTATGATCCTGAATGGTTGGCAATTTTGAGAAATACCAATCATCTTTTATCCGTCAAGAACGTAGATTGTCATCTACCTGGCCCCGGAGGTGATGAACGGTATGATTTCACACCAAGTGAAGAAGAGAAAAATGCAATATTAAGTCTATTAGATACATTAATAATAACCAATGATTCATTCGTCAAAACTGCACCGGTTTATAGGCCTGGTGCACCAAAATGTCAACCCACGGAACCTGTGCTAAACCCCCAAACCGCTTATTTATGTGAAAAGTTAGGTATTGATGATCCCATCCAGGTGATAATCGCTCGTTCAGGCAGAACTATAAGGCATGTACAAATTGAAAATAATCAGAATGAAGAGAAAGATGACATTATTGAACAGACACCATTCAAATGTTCAAAGCTTTCTCTCCCGGCTCCAATAACACCCAGTGGGAATGACGAGGACGCTTCAAGAGAAACATTAGCTTGTACACCAGAAAATAGTTTTTTATCTATCAGTAATACATCAGATTGTATAACACCTCCGAGTGCTACAAAAAAGGTTTTCAAGAGACGTAATCTAGCTATATACACTCCTGAGGAAGAGCCGGAGAGTGATTCAAGTAGTTCGTTCATGAGTACACAGAGTCCAAGATCGAGTAAAATATTCTGTAAAAATGACTTATAA

Protein sequence:

>DPOGS213526-PA
MKIAIEGCAHGELDKIYECVETLQRREGINVDLLICCGDFQSVRNNDDLRAMAVPEKYQNICTFYKYYSGEKIAPVLTLFIGGNHEASNYLQELPYGGWVAPNIYFLGRAGVVQFGNLRIGGLSGIFKGHDYLQGLWECPPYTPGSLRSVYHIRSLDVFRLSQMKENIHIMLSHDWPRGITSYGDKENLLRRKPFLRDDIESNQLGSPPAEKLLHTLKPQYWFAAHLHCQFAAVINHDNNRETKFLALDKCLPRRRHLQILDLATEYDGDKTLKYDPEWLAILRNTNHLLSVKNVDCHLPGPGGDERYDFTPSEEEKNAILSLLDTLIITNDSFVKTAPVYRPGAPKCQPTEPVLNPQTAYLCEKLGIDDPIQVIIARSGRTIRHVQIENNQNEEKDDIIEQTPFKCSKLSLPAPITPSGNDEDASRETLACTPENSFLSISNTSDCITPPSATKKVFKRRNLAIYTPEEEPESDSSSSFMSTQSPRSSKIFCKNDL-