Monarch geneset OGS2.0

DPOGS204581
TranscriptDPOGS204581-TA1539 bp
ProteinDPOGS204581-PA512 aa
Genomic positionDPSCF300376 + 83537-89829
RNAseq coverage592x (Rank: top 21%)
Annotation
Heliconius% 
BombyxBGIBMGA001847-TA0.073.37% 
DrosophilaCG4972-PA2e-7836.97% 
EBI UniRef50UniRef50_F4X3638e-12748.25%Nicalin n=8 Tax=Neoptera RepID=F4X363_ACREC
NCBI RefSeqXP_001604957.12e-13848.46%PREDICTED: similar to Nicalin [Nasonia vitripennis]
NCBI nr blastpgi|3407172222e-13849.03%PREDICTED: nicalin-1-like [Bombus terrestris]
NCBI nr blastxgi|3407172223e-13548.94%PREDICTED: nicalin-1-like [Bombus terrestris]
Group
Gene OntologyGO:00082331.8e-07peptidase activity
GO:00065081.8e-07proteolysis
KEGG pathway 
InterPro domain[183-240] IPR0074841.8e-07Peptidase M28
Orthology groupMCL15387 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204581-TA
ATGTCCCCTGTGAACCCTGTTGCCGCATCCCATGAATTTTCGGTATACAGAATGCAACAATATGATCTACACAATGTTCCCCACGGTTGCCGAAGCTCCAGTTTCAACTTGGAAGGCCGTTCCTTGAGCTCGTGGGGAACCTCTCGTCATTGCATTGTCGCACGCCTTCAAGATATCTCAGTGGACCAGTTTGTAGAAATAAGAAATAAGGCTGGTGCCCTTGTATTGGTCTTACCAAAGAACGTCACCTTACTCACTGATGAAGAAAAAGAGCACATGACACTCCTAGAGATGGCAATGATGCAGCAAGAGATCAACATTCCCGTGTACTTTGCGAACTGGTCACCGGAGTTTGAAGACATCATCGCCGATCTCCAACATAGCTTCATCACCGATGATAAATCCGGAACGGCTCTAGAGGCCATGTTCAATACCGTGTCCTCAAACGGATACCAGATAGTCGTATCAACGTCGTCTCCTCACAAGCTGGAATCCAAGCCGGTCACTCTTCACGGCAAGCTGTTTGGCCGAGCTGGGAACACTCAGACCATAGTCATAGCAGCTCACTACGACGCCAACAGCTTAGTACCGGAGTTGTCAACCGGTGCTGATTCCAACGCGTCCGGTGTCGCGGCTCTGTTAGAGTTGGCGCGAATCTTCTCTCGCCTCTATTCGGTCAGCTCTGACAGAGGCGGCCCTTCGATATTGTTCGTCCTCACCTCTACCGGACACTCTCTCAACTATTTCTCTACTAAGAAGTGGCTCGAAGAACAGTTGGACTCCACTGATGCGACTCTCTTACAGGACGTGTCGTTCATCATGTGCCTGGACTCTATATCGTCTGGGCCTCTGTCAATGCACGTGTCGAAGCCTCCCAAGCCGTGGACGCCGGCCAACTCCATAAAGTCCCGGCTTCGAGTCCCGGTCGTTCATAAGAAGATCAATTTGGCCGACGAGACCCTCGCCTGGCATCACGAGAGGTTCAGCATCAAGCGCATGACGGCCTTCACTGTTAGCTCGCTGCAGAGTCACAGGGATCCATCTCGCAGCTCAATCCTGGACACTGCCAGTGAGGGCTCACTGAACACCCTGTATACAAACGTGGTGGAAATATCCCGGGCCCTCGCCTCCCACGTGTACAACCTGACAGACGAACACATTCAAACGCATCTATACGACGAGGCCCTGGGTGTTGACATGGAATCCATCAAGCTGTGGTACAACTATTTGTCCTCTCAGCCGCGAGCGCCTCACATCGTCACGACCAGTGTGACCGGAGCCTTAGAGCGAGCGTTGGCGAAGTACGTGGAGGTGTCCGTCAGTGTGCACGCGACCGACCGCCGTGAGCCCGAACACGCGCTCTACGGACCCACCAGGGCGCTGTTATATGTATACAGTGTGAAACCGGCTGTATTCGACCTTATCCTGACCCTAGCGATCGTGGCCTACCTCACAGTTTTGTATTTCGCTATGCAAGTGTTCCCACGTTTCTATGAAGAATACGCGAAAATAGTGACAGGGAAAGCCAAGGTTCATTAA

Protein sequence:

>DPOGS204581-PA
MSPVNPVAASHEFSVYRMQQYDLHNVPHGCRSSSFNLEGRSLSSWGTSRHCIVARLQDISVDQFVEIRNKAGALVLVLPKNVTLLTDEEKEHMTLLEMAMMQQEINIPVYFANWSPEFEDIIADLQHSFITDDKSGTALEAMFNTVSSNGYQIVVSTSSPHKLESKPVTLHGKLFGRAGNTQTIVIAAHYDANSLVPELSTGADSNASGVAALLELARIFSRLYSVSSDRGGPSILFVLTSTGHSLNYFSTKKWLEEQLDSTDATLLQDVSFIMCLDSISSGPLSMHVSKPPKPWTPANSIKSRLRVPVVHKKINLADETLAWHHERFSIKRMTAFTVSSLQSHRDPSRSSILDTASEGSLNTLYTNVVEISRALASHVYNLTDEHIQTHLYDEALGVDMESIKLWYNYLSSQPRAPHIVTTSVTGALERALAKYVEVSVSVHATDRREPEHALYGPTRALLYVYSVKPAVFDLILTLAIVAYLTVLYFAMQVFPRFYEEYAKIVTGKAKVH-