Monarch geneset OGS2.0

DPOGS213621
TranscriptDPOGS213621-TA1428 bp
ProteinDPOGS213621-PA475 aa
Genomic positionDPSCF300033 + 1027166-1039536
RNAseq coverage110x (Rank: top 59%)
Annotation
HeliconiusHMEL0077840.091.76% 
BombyxBGIBMGA011789-TA0.092.63% 
DrosophilaCG42750-PB2e-12955.16% 
EBI UniRef50UniRef50_Q9VJ944e-12754.75%CG42400 n=20 Tax=Endopterygota RepID=Q9VJ94_DROME
NCBI RefSeqXP_396377.27e-15156.06%PREDICTED: similar to CG6154-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3287791563e-15056.47%PREDICTED: dipeptidase 1-like [Apis mellifera]
NCBI nr blastxgi|3072026552e-15157.08%Dipeptidase 1 [Harpegnathos saltator]
Group
Gene OntologyGO:00065084.1e-181proteolysis
GO:00168054.1e-181dipeptidase activity
GO:00082394.1e-181dipeptidyl-peptidase activity
GO:00082354.1e-181metalloexopeptidase activity
KEGG pathway 
InterPro domain[64-462] IPR0082574.1e-181Peptidase M19, renal dipeptidase
Orthology groupMCL18905 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213621-TA
ATGGGCTACGGGAACTACATGGACTACCAAGTGGCACCCGCTCACGCCTGCCTGTGCCGCGCGATCACGTCTCCTCACGCCCACGCGGCTGTACCAGAGGGCATGACATTTCCTGGTTCGTTACCAGATGTGGCAGAAAGATGCGGTCGCTGGGCGCCAGATTCATTATCAGCATCCGCATCTGGGAGCTCTGACGAAAAGCCAACGCCGCGTCGACCGCGGTGGCGTATCGCACTCGCTGCTCTCGTTGTATTGGCCGCACTCGGCGCTGCACTAGCCCTCCCCCTCGCCCTGGGAGGTGGCGGTCGTGCTACCCCTGAGCAGCGATTAACAACTATACGGAGAATGTTACGTGATTCTCCACTTATAGACGGTCATAATGACCTAGCGTGGAATGTTAGAAAATTTCTTCATAATAAAATAGGTGACTTTAATTTGAGTGCTGGCTTGGAGGGCTTAGAGCCGTGGGCCCGATCGCGCTGGTCGCACACGGATATTCCGCGGCTACGACTCGGTCAGATTGGGGCACAGTTTTGGTCAGCATACGTTCCATGTGGTGCGCGAGATAAAGACGCTGTGCAGTTGGCCATTGAACAAATGGATGTCATCAAACGGATAGTTGATATGAACGCAGCTCATCTTGCTTTAGTTACCGGCGCGTCGGACCTCTTAGATGCCCATCGAGATGGTCGGATTGCTTCTTTGATTGGAGTTGAAGGTGGCCATGCACTCGGTGATTCCCTAGCGGTGCTTCGCGCGTTTTATAACCTAGGTGCTCGATATCTTACTGTTACTCATACATGTGATACGCGATGGGCGCGTGCCGCTGGCACCTCTGGTGGGCTCACTGAGTTCGGACGCGCCGTTGTTCGAGAAATGAATCGTCTTGGTATGATAGTTGATCTGTCGCATGCAGGTGAAGAGACAGCCCGCGACGCTCTTGAAACTTCACAGGCACCCGTAGTATTCTCTCATTCTGGAGCTGCAGCAATATGTAATTCGTCTAGAAATGTGCCAGATGATCTGCTTCGCATGATCGCTGCGAATGGTGGCGTAGTTATGATTAATTTCTATGCTAAACTTGTAACATGCAGCGAGCGAGCGACAATCGAAGATGTTATTGCACACATAAACCACGTGCGAAGGGTAGCCGGAGTGGAGCACGTAGGTCTGGGAGCTGGTTATGATGGTATAGACGCACCGCCCGTGGGTTTAGAGGATGTTTCACGTTACCCCCACCTGTTAGCCGAGTTACTTCGCGATCCGGATTGGAGTGAAGAAGACGTTCGTAAGCTGGCCGGTATGAATGTTGTCCGCGTACTGCAGCACGTGGAGCGCGTGCGAGATCAATGGAAACGTGCCGCCGTTTTTCCTGGCGAAGAAACACCTGGTGCGCGACGCAGCGAGTGCGTGTACGGAACCGCGTGA

Protein sequence:

>DPOGS213621-PA
MGYGNYMDYQVAPAHACLCRAITSPHAHAAVPEGMTFPGSLPDVAERCGRWAPDSLSASASGSSDEKPTPRRPRWRIALAALVVLAALGAALALPLALGGGGRATPEQRLTTIRRMLRDSPLIDGHNDLAWNVRKFLHNKIGDFNLSAGLEGLEPWARSRWSHTDIPRLRLGQIGAQFWSAYVPCGARDKDAVQLAIEQMDVIKRIVDMNAAHLALVTGASDLLDAHRDGRIASLIGVEGGHALGDSLAVLRAFYNLGARYLTVTHTCDTRWARAAGTSGGLTEFGRAVVREMNRLGMIVDLSHAGEETARDALETSQAPVVFSHSGAAAICNSSRNVPDDLLRMIAANGGVVMINFYAKLVTCSERATIEDVIAHINHVRRVAGVEHVGLGAGYDGIDAPPVGLEDVSRYPHLLAELLRDPDWSEEDVRKLAGMNVVRVLQHVERVRDQWKRAAVFPGEETPGARRSECVYGTA-