Monarch geneset OGS2.0

DPOGS201363
TranscriptDPOGS201363-TA1365 bp
ProteinDPOGS201363-PA454 aa
Genomic positionDPSCF300083 - 293075-295993
RNAseq coverage331x (Rank: top 35%)
Annotation
HeliconiusHMEL0021420.077.16% 
Bombyx% 
DrosophilaCG10098-PA1e-3631.49% 
EBI UniRef50UniRef50_Q0VDG43e-8645.09%Secernin-3 n=26 Tax=Euteleostomi RepID=SCRN3_HUMAN
NCBI RefSeqXP_970316.29e-12659.95%PREDICTED: similar to GA18260-PA, partial [Tribolium castaneum]
NCBI nr blastpgi|2700078482e-12560.45%hypothetical protein TcasGA2_TC014587 [Tribolium castaneum]
NCBI nr blastxgi|2700078481e-12160.35%hypothetical protein TcasGA2_TC014587 [Tribolium castaneum]
Group
Gene OntologyGO:00065081.4e-09proteolysis
GO:00168051.4e-09dipeptidase activity
KEGG pathway 
InterPro domain[62-284] IPR0053221.4e-09Peptidase C69, dipeptidase A
Orthology groupMCL17484 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201363-TA
ATGACTCATAAAGTTTGTGTCACACATTATTCACATTTCAATTTGACATGTGACAATTATTATTTGGCATCAGGCATTGTGATTACGTTCCGACTTCTGAATTCACGCAAACAATTCGTTAGGAAAAATATTAAAAAATATTTACTATTCGCTATGACAACTATAATGGAAAAACCAAGATCATGTGATACTTTTGTTGTTCTTCCACCTCTGACAATAAACAATGTTGTAATATTTGGAAAAAATTCAGACCGTCCACAGAACGAAGTCCAAGAAGTGATATTGTCCCAAGATCGAACTCGCGACTCTAAATTAAAGTGCACTTATATAACAATAGATGAGTGCACAGATCCAATAAACAATGTGATATTAAGCAAGCCCGCTTGGATGTGGGGAGCTGAAATGGGTGCCAATGATAAAAATGTAGTGATAGGTAATGAGGCTGTGTGGACTAACAACAATGAAGGGGATGGAGATGCAAGACAGAAGCGTCTCCTGGGAATGGATTTGGTGCGATTAGGCCTTGAGAGGGGAAATACAGCCGAAAAAGCTCTTGATGTGATCACATCACTATTAGAGAAATATGGACAAGGCGGGCCCTGCTCTGAATATGATGACAGTCATTTCTATCATAACTCCTTTCTCATTGCTGATTACAAAGAGGCTTGGGTACTTGAGACTAGTGGGAAAATGTGGGCCGCAGAGAGAATTGATTCGGGATACAGAAATATTTCCAATGGACTTACCATTGGAACGAAAATCGATAAGCATTCTGAGGGGCTGTTTGAAAAAGCTGAGGCTATGGGACTCTGGGATGGGAAGGGTTTGTTCGACTTCAGCGCTGCATTTTCTTCGGGCGGAGACGAGCTCCGTCAGAAACAGGGTGAACGGCTTCTGAAACAAGCAACAGTGACTTCTGTTTTCGATGTTACTGATATGTTCCGTATACTGAGGCACAAAGAGAGTGGTATCTGCCGCGCATGTGACGACACCTTCCCCACCCAAGGAAGCCAGGTATCATCTCTATCGTCAGTTGGCATAAGCGTACATTGGTTTACAGCGACTCCAGACCCGAGCGTATCTTACTTCAAGCCGTTTGTTTTCACTCCTAACGCTAGAATCTCACCGTACACCGAAAGCCCGTCAGCGCCGAATCGCGAACACCATCTTTATAAGTTGCATTCGGCGCACGTTTTGAAGAATAACAACGAGAAAATGTCGAAAGTACTGGCTGATATCGAGAACGGATGCATCGCGGAAATAACAGATTTCATGAAGAAATACGAGCTTAAAGAGAAAAATATCAACGAACTCGACGACCTGATGAAGAAGTGCGTGGAGACAGAAGTCAAACTTTACGGTTAA

Protein sequence:

>DPOGS201363-PA
MTHKVCVTHYSHFNLTCDNYYLASGIVITFRLLNSRKQFVRKNIKKYLLFAMTTIMEKPRSCDTFVVLPPLTINNVVIFGKNSDRPQNEVQEVILSQDRTRDSKLKCTYITIDECTDPINNVILSKPAWMWGAEMGANDKNVVIGNEAVWTNNNEGDGDARQKRLLGMDLVRLGLERGNTAEKALDVITSLLEKYGQGGPCSEYDDSHFYHNSFLIADYKEAWVLETSGKMWAAERIDSGYRNISNGLTIGTKIDKHSEGLFEKAEAMGLWDGKGLFDFSAAFSSGGDELRQKQGERLLKQATVTSVFDVTDMFRILRHKESGICRACDDTFPTQGSQVSSLSSVGISVHWFTATPDPSVSYFKPFVFTPNARISPYTESPSAPNREHHLYKLHSAHVLKNNNEKMSKVLADIENGCIAEITDFMKKYELKEKNINELDDLMKKCVETEVKLYG-