Monarch geneset OGS2.0

DPOGS214131
TranscriptDPOGS214131-TA2217 bp
ProteinDPOGS214131-PA738 aa
Genomic positionDPSCF300014 - 1423930-1429270
RNAseq coverage115x (Rank: top 59%)
Annotation
HeliconiusHMEL0113660.075.79% 
BombyxBGIBMGA006179-TA0.071.45% 
DrosophilaCG11034-PB1e-12236.10% 
EBI UniRef50UniRef50_Q7PNC71e-13238.81%AGAP008176-PA n=5 Tax=Culicidae RepID=Q7PNC7_ANOGA
NCBI RefSeqXP_001647765.18e-14739.87%dipeptidyl-peptidase [Aedes aegypti]
NCBI nr blastpgi|1571418832e-14539.87%dipeptidyl-peptidase [Aedes aegypti]
NCBI nr blastxgi|1571418833e-14539.78%dipeptidyl-peptidase [Aedes aegypti]
Group
Gene OntologyGO:00160202.4e-61membrane
GO:00065082.4e-61proteolysis
GO:00082365.6e-42serine-type peptidase activity
KEGG pathway 
InterPro domain[85-441] IPR0024692.4e-61Peptidase S9B, dipeptidylpeptidase IV N-terminal
[532-732] IPR0013755.6e-42Peptidase S9, prolyl oligopeptidase, catalytic domain
Orthology groupMCL12716 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214131-TA
ATGCAAATTTGGTTGGTATTGGCGTGTGTGCTGGCAGCAGCTATTGGCCAAGAACCTTTTACATTAGAGGAATTCGTGAGAGGCCGCTTCAGTCAACGTGGATTCAACGGTACCTGGATATCCGACAATGAGTTTACTTACACGATATCAGGCGAACCTGGAATCTACATGTACAATGCTGAGACACGCAACTCAACTGTACTCGTTTCTGGGGAACTAATGGGTTTTCTGAATACAAGCAATCCTATTCTTTCAGCGGATAGGCGATATATATTGGCACCAAGTGAAGTCCAGCAGGTTTACAGATATTCGACCACGGCAAAATTTGCCCTTTACGAAATTGCAACTACCAATGTGACAAACATAGCCAACCATGCCCGATTGCAGCTCTGCCTTTTTGGTAGCGGACACTCGCTAGCTTTCGTGCTTGACAATAATGTGTATTACTTGCCGGAGAACAGTACTACAGCAATACAACTCACATCAGATGGAATTCCTGGTGTAATTTATAATGGTCACACAGATTGGGTTTATGAAGAGGATGTTATGTATACCGGAGTTGCTACCTGGTTTTCCACCCGGGGAACTTACCTCGCTTTTGCCAGCTACAATGACACTTTGGTGGAGACCTACTCTTACTATCACTTCGTGGATAAAAGTGATCCCGATGACGTCTACCCGGAACTAATTGATCTGAAATATCCAAAGGTTGGTAGAACCAATCCAACAGTCAAACTTCGAGTAGTGGATCTGAGAACAGTTGCCACTAATCTGACCTACATCACCCTCGACGCTCCAGAAGAAGTTACCAGTGACCATATTCTGGGTGGAGTAACTTGGACAACAGATACTGAAATTGCAGTTCACTGGCTTAATCGACGACAAAACTATACAATTCTTAGAATCTACGAGAATCGATCCCAGTCTAACGGTTGGGTGCCCATTGCTTTACCAAGCTTCAGTGAGAATGGTGATTTCTATGTATCAACTCGATGGTCGTTGAGGCAAGCCGACGGAAATATTTGGCAGCATTTGTATTTAAGTGTCAGGGTCAATGATGAAATAATTTCAAGTTCCATCACTCCCGGAGCACACACTGTTAACAATTTCATTGGCATGGATGAAACCAATCTGGCTTACTATTACACTCGTACTGTCACTGATGCTCCATGGCAGACTCAAGTGCACGTTGCGGGCGCTAGGGTGGCTTGTCTGAGTTGTGTTCTCCAGATGCCTGATGGAGGACCTTGTACTTGGGTGACCTCAACAGCTAGTCTTGGCGCTAGATACTTATCCATCACTTGCCAGTCACCTGAAGAACCGTCTGCTACTTTCGTTGTCAATCCCCTGAATATGTCAGAAAGGTTTATCTGGGAGGATAATCGAATCGTCCGAGAGCGTCTTGTTAACAAAACTAGACCTCTTACCGTTAACACGACAGTGCCTTTGGAAAACGGGCATCCAGCACCTGTTAAACTGTTTTTGCCGCCTGGACTCAATATATCCGACACCAGCGTCAAGTACCCCATGGTGTTCTATGTGTACTCAGGACCCAACACTAACACGGTGTTCGATACCTTCACTGTTGGTTACCACTCCTACCTAACAACTAGTCGCAACACCATCTACATGATGGCTGACCGGCGTGGTGCTGGACTTCATGGTCAGGATATGCTCTTTTCCTTGAACAACGCTCTCGGCACAGTCGAGATCGAAGATCACTTCGTCGTGCTGAGACAAGTTCTGGAGTTGTACAAATTCATAGACCCAGAAAGAGTCGCTATTTGGGGCCACAGCTATGGCGGATACGCGACACTACTCACCTTGGTTCATGATGACGAGAATATGTTCCAATGTGGAGTCTCAACTGCACCTGTCACCTCCTGGCTTTATTATAATTCTATGTATACCGAGAGGTACATGGGTTTGCCGACGGTTGAGGATAATTTGGCGGGATACCAAGCAGGTGACGTCACCTTGTGGGCTGAGAAACTCCGCGGGAAGAAGCTATACCTGATGCATGGAAACGCCGATGATAATGTACACTATCAAAACGCTGCCAAGTTGATGAGAGCGTTGCAGCAACTTAATATACCTTTTGATCAAATGTCTTATCCCGATGAAGCCCACAGCTTATCTAACGTCAACATGCACAGATACGGAACAATGAACAAATATTGGGAGCAGTGTATACAAATGCCGGCTGAGATATCCTAG

Protein sequence:

>DPOGS214131-PA
MQIWLVLACVLAAAIGQEPFTLEEFVRGRFSQRGFNGTWISDNEFTYTISGEPGIYMYNAETRNSTVLVSGELMGFLNTSNPILSADRRYILAPSEVQQVYRYSTTAKFALYEIATTNVTNIANHARLQLCLFGSGHSLAFVLDNNVYYLPENSTTAIQLTSDGIPGVIYNGHTDWVYEEDVMYTGVATWFSTRGTYLAFASYNDTLVETYSYYHFVDKSDPDDVYPELIDLKYPKVGRTNPTVKLRVVDLRTVATNLTYITLDAPEEVTSDHILGGVTWTTDTEIAVHWLNRRQNYTILRIYENRSQSNGWVPIALPSFSENGDFYVSTRWSLRQADGNIWQHLYLSVRVNDEIISSSITPGAHTVNNFIGMDETNLAYYYTRTVTDAPWQTQVHVAGARVACLSCVLQMPDGGPCTWVTSTASLGARYLSITCQSPEEPSATFVVNPLNMSERFIWEDNRIVRERLVNKTRPLTVNTTVPLENGHPAPVKLFLPPGLNISDTSVKYPMVFYVYSGPNTNTVFDTFTVGYHSYLTTSRNTIYMMADRRGAGLHGQDMLFSLNNALGTVEIEDHFVVLRQVLELYKFIDPERVAIWGHSYGGYATLLTLVHDDENMFQCGVSTAPVTSWLYYNSMYTERYMGLPTVEDNLAGYQAGDVTLWAEKLRGKKLYLMHGNADDNVHYQNAAKLMRALQQLNIPFDQMSYPDEAHSLSNVNMHRYGTMNKYWEQCIQMPAEIS-