Monarch geneset OGS2.0

DPOGS206546
TranscriptDPOGS206546-TA2649 bp
ProteinDPOGS206546-PA882 aa
Genomic positionDPSCF300190 + 164251-175847
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0022800.076.62% 
BombyxBGIBMGA005912-TA9e-14661.57% 
DrosophilaCG42750-PB6e-16464.07% 
EBI UniRef50UniRef50_D6WLX67e-16767.64%Putative uncharacterized protein n=4 Tax=Tribolium castaneum RepID=D6WLX6_TRICA
NCBI RefSeqXP_974577.14e-16759.16%PREDICTED: similar to CG31746 CG31746-PA [Tribolium castaneum]
NCBI nr blastpgi|2700077352e-16667.64%hypothetical protein TcasGA2_TC014432 [Tribolium castaneum]
NCBI nr blastxgi|910832953e-16560.48%PREDICTED: similar to CG31746 CG31746-PA [Tribolium castaneum]
Group
Gene OntologyGO:00065082.7e-202proteolysis
GO:00168052.7e-202dipeptidase activity
GO:00082392.7e-202dipeptidyl-peptidase activity
GO:00082352.7e-202metalloexopeptidase activity
KEGG pathway 
InterPro domain[481-870] IPR0082572.7e-202Peptidase M19, renal dipeptidase
Orthology groupMCL25184 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206546-TA
ATGATTCAAGTGGATGGTTTCAAACGCTCGTCTCCGTCACCATCGCGATGCTCAGTTCGGAGTGACAGTCGGGTCTTCAGACAGCCTCGCGTCATTCAAGTTAACCGAGACGCAGCTACAATGATGGACGATGAAGCCAGTGTCAGCGGCATGGGGAATGTCACTGAACCTCCTTGCTCCTCAGATTTCGAATCTAACGTTAATCAGTCAACACATTTTCGATCACGGACATTTAGTGACGGCTTCATTAGAAACACAAGCCTCGAATTGTTACTTGGTGTTGATTGCGAGGCCTGTCTGGCGCTCGCTGCACAACAACATTGCATCAATGAACCTATAACGGAACAGGATTTTGATTTGGTATGCAATTGTAATGTTCGCCAACATAATTACTTGAACTATTACCGTGACTCCGTCAAACAGAAGGGGCATACTCTGAATATTTATGATATAGAAAGAATTAAGAGAGACACGGGAAGAAGTAGTCTAGCTGTCGGTTCACATCGTAGTCTTGATAGAAAACATTACAAAAGTAAAACCATAGGCGTTTCAAGCTACATATCCAAGCAGTTAATATCATATGACCCGACTAATACACTTGTACAAAAACTCAAAAGAAAATTGTCTAATCTGAAAAAAGTGAGGGGTGAAAATATTGTGAAAAAACCACATAATGGCCTTCAACTTGCTGAAATGGCATCCACTTCTACTAGCTCCCTAAAAAGGCTATCAATGTTATCACTTGTTGATAAATCGCAAAAAAAAATGGACAATCAAAATTGCATTTTTTCTTCAAACGGTGTGCAATCGAAAGATGATTTCAAAAACAATGTTACCACATATGAAAATTTTTCTTCCAATATTGACCAATGCTATGACCGGCTTCGTACATCCAACATCGATACATCACAAAAGAACGCCAAAGGAAGAAATTTTGATATCAACGACGAACTAATTCCGAGGAACCCGCCTTTGGTGAAATATTCAACACTTGATAATCGAAGCAGGTGTCGAAGTATAAAGAGACAGCTCTCGTGTAACGATTTTCCCAGCGACAGGTGTAGGGAACGTGTGCCCGAGACATGGTCGCAAGACGAGAACTCTTTTGGTTTGAGAAATTCGAGGAATGAAGTAGATGACGAACAGACATTCATAGAAGCTCACAAAGTATTAAGAGGTGCTCCAAGAAGTGCTCATGCACATTGTACCTGTAACTCCATTCAAGACAATAGAATACCGTCTATATTTTATGACACCCATGGTAAAAAGCGTTCAGCCCCCGCCCCTGACGTTACGGATCGCTCCCAGGGAAGATCTTGCAGCGACTGCAAACCTGCACAGTGGTACCCCCCGAGCACATCCGCGACCAGCCACAGTTCTACGGTTACTGCTAATTCATCACGTCTGACAGCATGGCACCGTCGATGGTGTTGCGTGGCTATACTTGTGTTGGTAGCGGGCACAGCCTGCGTTGCCGGACCGCTGGCTCTTAGGGCTCCACCTGGTGCACCCCTACACGAAAGACTCCGCCTCGCAGAAAGACTACTGCACGACACACCACTTATTGATGGACATAATGATTTACCTTGGAATATACGAAAATTTTTACACAATAAAATCAAAGACTTTAGATTTGATGAAGACCTTCGAACTATATCTCCCTGGGCTACGAGCTCGTGGAGTCATACAGATTTACTTCGTCTTAAGCATGGAAGAGTAGCCGCTCAGTTTTGGGCCGCATACGTGCCTTGCGACGCGCAACATCGGGATGCAGTGCAATTGACCTTCGAACAAATAGACCTAATCCAGAGACTCACAGACAAGTACCATCCACAACTAACATTCTGTACCTCTGCCGACGATATATTATCGGCTCACGTAAACCACCGGCTGTGCTCACTGGTGGGTGTGGAAGGTGGGCATGCAATTGGAGGTTCCTTAGGTGTACTAAGGACGTTGTATCAAGTTGGAGTTCGGTATCTAACTCTAACTTCGACTTGCGATACGCCTTGGGCTGAATGTGCTTCCACCGATCGACCTGAATCCGCACAAAGGGGAGGATTAACGCCTTTTGGTAAAGTGGTGGTTAAAGAAATGAATAGATTGGGCATGCTGGTTGATCTATCACATGTTTCTGAGCGAACCATGCGGGATGCCCTTTCGGTTTCACGAGCGCCAGTGCTTTTCTCACATTCCTCGGCCCGAGCGCTTTGTAACGTAACTCGAAATGTACCAGACAGCGTGCTTCGACTCTTAGCAGCTAATAAAGGACTGATAATGGTCAACTTCTACACTTCTTTTCTCACTTGTAGAGATACGGCTACCGTTCAGGATGCTATAGAACACATAAACCATATCCGCGACATCGCTGGTGTCGACAGCGTTGGCTTAGGAGCAGGATACGATGGAATAAATTACACACCTCATGGGTTAGAAGATGTCTCGTCATATCCATTATTATTTGCTGAACTGATGGAAGACGGATGGAGCATAGAAGATTTGAAAAAATTGGCTGGCCTGAATTTATTACGTGTAATGAACGCAGCAGAACGTGTATCTAGAGAATTATCATCAGCCCATGTCACTCCTTACGAAGAAGTTGGACCCAGAGTGTTAGACTCGCACAATTGTTCCAGTCAGGACGTTTAA

Protein sequence:

>DPOGS206546-PA
MIQVDGFKRSSPSPSRCSVRSDSRVFRQPRVIQVNRDAATMMDDEASVSGMGNVTEPPCSSDFESNVNQSTHFRSRTFSDGFIRNTSLELLLGVDCEACLALAAQQHCINEPITEQDFDLVCNCNVRQHNYLNYYRDSVKQKGHTLNIYDIERIKRDTGRSSLAVGSHRSLDRKHYKSKTIGVSSYISKQLISYDPTNTLVQKLKRKLSNLKKVRGENIVKKPHNGLQLAEMASTSTSSLKRLSMLSLVDKSQKKMDNQNCIFSSNGVQSKDDFKNNVTTYENFSSNIDQCYDRLRTSNIDTSQKNAKGRNFDINDELIPRNPPLVKYSTLDNRSRCRSIKRQLSCNDFPSDRCRERVPETWSQDENSFGLRNSRNEVDDEQTFIEAHKVLRGAPRSAHAHCTCNSIQDNRIPSIFYDTHGKKRSAPAPDVTDRSQGRSCSDCKPAQWYPPSTSATSHSSTVTANSSRLTAWHRRWCCVAILVLVAGTACVAGPLALRAPPGAPLHERLRLAERLLHDTPLIDGHNDLPWNIRKFLHNKIKDFRFDEDLRTISPWATSSWSHTDLLRLKHGRVAAQFWAAYVPCDAQHRDAVQLTFEQIDLIQRLTDKYHPQLTFCTSADDILSAHVNHRLCSLVGVEGGHAIGGSLGVLRTLYQVGVRYLTLTSTCDTPWAECASTDRPESAQRGGLTPFGKVVVKEMNRLGMLVDLSHVSERTMRDALSVSRAPVLFSHSSARALCNVTRNVPDSVLRLLAANKGLIMVNFYTSFLTCRDTATVQDAIEHINHIRDIAGVDSVGLGAGYDGINYTPHGLEDVSSYPLLFAELMEDGWSIEDLKKLAGLNLLRVMNAAERVSRELSSAHVTPYEEVGPRVLDSHNCSSQDV-