Monarch geneset OGS2.0

DPOGS208018
TranscriptDPOGS208018-TA5712 bp
ProteinDPOGS208018-PA1903 aa
Genomic positionDPSCF300203 - 350504-356215
RNAseq coverage1297x (Rank: top 10%)
Annotation
HeliconiusHMEL0121382e-12345.02% 
BombyxBGIBMGA001504-TA2e-5445.98% 
DrosophilaMuc68D-PB1e-1627.75% 
EBI UniRef50UniRef50_Q86BV00.045.76%Peritrophin 1 n=2 Tax=Noctuidae RepID=Q86BV0_9NEOP
NCBI RefSeqNP_001161929.11e-9927.87%peritrophic matrix protein 14 [Tribolium castaneum]
NCBI nr blastpgi|306921030.045.76%peritrophin 1 [Mamestra configurata]
NCBI nr blastxgi|306921030.045.53%peritrophin 1 [Mamestra configurata]
Group
Gene OntologyGO:00080619.1e-20chitin binding
GO:00060309.1e-20chitin metabolic process
GO:00055769.1e-20extracellular region
KEGG pathwaytca:6625044e-20 
 K01873 (VARS, valS)maps-> Aminoacyl-tRNA biosynthesis
    Valine, leucine and isoleucine biosynthesis
InterPro domain[1671-1743] IPR0025579.1e-20Chitin binding domain
Orthology groupMCL11232 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208018-TA
ATGAATTGCCCGGAAGGTTTGTTATATAATCCTTATGAGAGCTTATGCGATTATCCCAGCAATGTTAAGTGTGGTGACCGAGTCATTCCAAGCCCAGATGAAAATAAGCCAGGCGATGAAAACGATAACAGCGATAACAACAACGGTAATGAGGATAATAACGGAGGGGACAACAATGGCCCTTGTAATTGTGTTCCAGACGAAGCGCCTGCTATTTGCGGTAAAGCTGGATCCGATGGAATACTCATCGCACATGAACATTGTGATAAGTTTTACAAATGCTCTCACGGCAAGAACGTTTCCATGAGCTGTCCTTCAGGTTTGTTATACAATCCTTACAAAGGATGGTGCGACTATCCAAGCAATGTTAAGTGTGGAGATCGAGTCATTCCAAATCCAAAAGAAGACAAACCAGATAGCGAAGATGGAGACTGTAATGACGACAATGGCAATGACGATAATAACGGAGGGGACAACAATGGCCCTTGTAATTGTGTTCCAGACGAAGCGCCTGCTATTTGCGGTAAAGCTGGATCCGATGGAATACTCATCGCACATGAACATTGTGATAAGTTTTACAAATGCTCTCACGGCAAGAACGTTTCCATGAGCTGTCCTTCAGGTTTGTTATACAATCCTTACAAAGGATGGTGCGACTATCCAAGCAATGTTAAGTGTGGAGATCGAGTCATTCCAAATCCAAAAGAAGACAAACCAGATAGCGAAGATGGAGACTGTAATGACGACAATGGCAATGACGATAATAACGGAGGGGACAACAATGGCCCTTGTAATTGTGTTCCAGACGAAGCGCCTGCTATTTGCGGTAAAGCTGGATCCGATGGAATACTCATCGCACATGAACATTGTGATAAGTTTTACAAATGCTCTCACGGCAAGAACGTTTCCATGAGCTGTCCTTCAGGTTTGTTATACAATCCTTACAAAGGATGGTGCGACTATCCAAGCAATGTTAAGTGTGGAGATCGAGTCATTCCAAATCCAAAAGAAGACAAACCAGATAGCGAAGATGGAGACTGTAATGACGACAATGGCAATGACGATAATAACGGAGGGGACAACAATGGCCCTTGTAATTGTGTTCCAGACGAAGCGCCTGCTATTTGCGGTAAAGCTGGATCCGATGGAATACTCATCGCACATGAACATTGTGATAAGTTTTACAAATGCTCTCACGGCAAGAACGTTTCCATGAGCTGTCCTTCAGGTTTGTTATACAATCCTTACAAAGGATGGTGCGACTATCCAAGCAATGTTAAGTGTGGAGATCGAGTCATTCCAAATCCAAAAGAAGACAAACCAGATAGCGAAGATGGAGACTGTAATGACGACAATGGCAATGACGATAATAACGGAGGGGACAACAATGGCCCTTGTAATTGTGTTCCAGACGAAGCGCCTGCTATTTGCGGTAAAGCTGGATCCGATGGAATACTCATCGCACATGAACATTGTGATAAGTTTTACAAATGCTCTCACGGCAAGAACGTTTCCATGAGCTGTCCTTCAGGTTTGTTATACAATCCTTACAAAGGATGGTGCGACTATCCAAGCAACGTTGAGTGTGGTGACCGAGTTATTCCAAACCCAGAAGAAGACAAGCCAGATAACGAAGATGGTGACTGTGATGATGACAATGGCAATGACGATAACAACGGAGGTGACGACAATGGACCTTGCAACTGTGATCCGGATAAAGCTCCTGCCATTTGTGGCCAACCTGGATCTAATGGGATTCTTATCGCACATGAACATTGTAACAAATTCTACCAATGCTCCAACGGCAAGAACGTAACCATGAACTGTCCGGCTGGTTTGTTATACAATCCTTACAAAAAATTGTGCGACTATCCAAGCAAAGTTAAGTGTGGTGATCGAGTTATTCCAGACCCAGAAGAAGACAAGCCAGATAACGAAGATGGTGACTGTGATGATGACAATGGCAATGACGATAACAACGGAGGTGACGACAATGGACCTTGCAACTGTGATCCGGATAAAGCTCCTGCCATTTGTGGCCAACCTGGATCTAATGGGATTCTTATCGCACATGAACATTGTAACAAATTCTACCAATGCTCCAACGGCAAGAACGTAACCATGAACTGTCCGGCTGGTTTGTTATACAATCCTTACAAAAAATTGTGCGACTATCCAAGCAAAGTTAAGTGTGGTGACCGAGTTATTCCAAACCCAGAAGAAGACAAGCCAGATAACGAAGATGGAGACTGTGATGATGACAATGGCAATGACGATAACAACGGAGGTGACGACAATGGACCTTGCAACTGTGATCCGGATAAAGCTCCTGCCATTTGTGGCCAACCTGGATCTAATGGGATTCTTATCGCACATGAACATTGTAACAAATTCTACCAATGCTCCAACGGCAAGAACGTAACCATGAACTGTCCGGCTGGTTTGTTATACAATCCTTACAAAAAATTGTGCGACTATCCAAGCAAAGTTAAGTGTGGTGACCGAGTTATTCCAAACCCAGAAGAAGACAAGCCAGATAACGAAGATGGAGACTGTGATGATGACAATGGCAATGACGATAACAACGGAGGTGACGACAATGGACCTTGCAACTGTGATCCGGATAAAGCTCCTGCCATTTGTGGCCAACCTGGATCTAATGGGATTCTTATCGCACATGAACATTGTAACAAATTCTACCAATGCTCCAACGGCAAGAACGTAACCATGAACTGTCCGGCTGGTTTGTTATACAATCCTTACAAAAAATTGTGCGACTATCCAAGCAAAGTTAAGTGTGGTGACCGAGTTATTCCAAACCCAGAAGAAGACAAGCCAGATAACGAAGATGGAGACTGTGATGATGACAATGGCAATGACGATAACAACGGAGGTGACGACAATGGACCTTGCAACTGTGATCCGGATAAAGCTCCTGCCATTTGTGGCCAACCTGGATCTAATGGGATTCTTATCGCACATGAACATTGTAACAAATTCTACCAATGCTCCAACGGCAAGAACGTAACCATGAACTGTCCGGCTGGTTTGTTATACAATCCTTACAAAAAATTGTGCGACTATCCAAGCAAAGTTAAGTGTGGTGACCGAGTTATTCCAAACCCAGAAGAAGACAAGCCAGATAACGAAGATGGAGACTGTGATGATGACAATGGCAATGACGATAACAACGGAGGTGACGACAATGGACCTTGCAACTGTGATCCGGATAAAGCTCCTGCCATTTGTGGCCAACCTGGATCTAATGGGATTCTTATCGCACATGAACATTGTAACAAATTCTACCAATGCTCCAACGGCAAGAACGTAACCATGAACTGTCCGGCTGGTTTGTTATACAATCCTTACAAAAAATTGTGCGACTATCCAAGCAAAGTTAAGTGTGGTGACCGAGTTATTCCAAACCCAGAAGAAGACAAGCCAGATAACGAAGATGGAGACTGTGATGATGACAATGGCAATGACGATAACAACGGAGGTGACGACAATGGACCTTGCAACTGTGATCCGGATAAAGCTCCTGCCATTTGTGGCCAACCTGGATCTAATGGGATTCTTATCGCACATGAACATTGTAACAAATTCTACCAATGCTCCAACGGCAAGAACGTAACCATGAACTGTCCGGCTGGTTTGTTATACAATCCTTACAAAAAATTGTGCGACTATCCAAGCAAAGTTAAGTGTGGTGACCGAGTTATTCCAAACCCAGAAGAAGACAAGCCAGATAACGAAGATGGAGACTGTGATGATGACAATGGCAATGACGATAACAACGGAGGTGACGACAATGGACCTTGCAACTGTGATCCGGATAAAGCTCCTGCCATTTGTGGCCAACCTGGATCTAATGGGATTCTTATCGCACATGAACATTGTAACAAATTCTACCAATGCTCCAACGGCAAGAACGTAACCATGAACTGTCCGGCTGGTTTGTTATACAATCCTTACAAAAAATTGTGCGACTATCCAAGCAAAGTTAAGTGTGGTGACCGAGTTATTCCAAACCCAGAAGAAGACAAGCCAGATAACGAAGATGGAGACTGTGATGATGACAATGGCAATGACGATAACAACGGAGGTGACGACAATGGACCTTGCAACTGTGATCCGGATAAAGCTCCTGCCATTTGTGGCCAACCTGGATCTAATGGGATTCTTATCGCACATGAACATTGTAACAAATTCTACCAATGCTCCAACGGCAAGAACGTAACCATGAACTGTCCGGCTGGTTTGTTATACAATCCTTACAAAAAATTGTGCGACTATCCAAGCAAAGTTAAGTGTGGTGATCGAGTTATTCCAGACCCAGAAGAAGACAAGCCAGATAACGAAGATGGAGACTGTGATGATGACAATGGCAATGACGATAACAACGGAGGTGACGACAATGGACCTTGCAACTGTGATCCGGATAAAGCACCTGCCATTTGTGGTCAACCTGGATCTGATGGGATTCTTGTAGCACATGAACATTGTGACAAGTTCTACAAATGCTCTAACGGCAAAAACGTTTCCATGAATTGCCCGGCAGGTTTGTTATACAATCCTTACAAAGGATGGTGCGACTATCCCAGCAACGTTGAGTGTGGTGACCGGGTTATTCCAAAACCAGAAGAAGACAAGCCAGATAATGAGGATGGAGATTGTGAAGACGACAATAACGGAGGTGACAATGATGGACCCTGTAATTGTATTCCAGATGAAGCTCCCATTATTTGTGCTAGACAAGGTTCCCATGGAATACTTATAGCACACAAACATTGTAATAAGTTCTACGTATGTGTCAACGGCAGAAATGTAACTATGAATTGTCCTGCTGGATTATTCTATAATCCCTATAGAGAGGTGTGTGATTTTCCTAACAATGTTAAGTGTGGAGACCGTATCATTGTGGACCCTGAAGATGAAAAGCCAGAGGAAGAGGAAGATAACGGAAATGATGATAACAATGGAGAAGATGGGAATGGACCTTGCAATTGTAATCCAGGTGAAGCACCCGCTATTTGCGCTAGAGCCGGATCCGATGGAGTATTTATTGCACATGAACATTGCAACAAGTTCTATCAATGCGCTCATGGTAGAAACGTGACCATAAGTTGCCCGGCAGGATTGTTATATAATCCTTATAAGAAGAGGTGTGATTACCCGGATAATGTTAAGTGTAATGACCGAATTATTCCAGATCCAGAAGAACCAGATAACGAAAACGAAGATGATAACAATGGAGGTGATAATGATGGATCTTGTAATTGTGTTCCAGAGGAAGCGCCTGAAATATGCGCAAAGATTGGGTCTGATGGAACACTTATTGCACATCAATATTGTGACAAATATTACGCCTGTATGCACGGCAGGAATGTGACAATGCGTTGTCCTGCAGGTTTATTATATAATCCGTACAGACAATGGTGCGATTACCCGAATAATGTAAAGTGTGGCGACCGCATAAACCCAGAACAAGACAGGGATACTTGCAATTGCAGTCTATCTTATGCTTTAACGACTTGCGAAAATGAAAACACAAATGGAAAAATCATTGCTCATGAAATATGTGATCGGTTTTATTCTTGCTCTAATAAGGAGCCGGTTGAGTTATTATGTCCTGAGGGACTACTTTTTGATGCAAAGAAACAAATTTGTGACTGGCCAAGCAATGTTGACTGTGGAGAAAGGATTCAATGA

Protein sequence:

>DPOGS208018-PA
MNCPEGLLYNPYESLCDYPSNVKCGDRVIPSPDENKPGDENDNSDNNNGNEDNNGGDNNGPCNCVPDEAPAICGKAGSDGILIAHEHCDKFYKCSHGKNVSMSCPSGLLYNPYKGWCDYPSNVKCGDRVIPNPKEDKPDSEDGDCNDDNGNDDNNGGDNNGPCNCVPDEAPAICGKAGSDGILIAHEHCDKFYKCSHGKNVSMSCPSGLLYNPYKGWCDYPSNVKCGDRVIPNPKEDKPDSEDGDCNDDNGNDDNNGGDNNGPCNCVPDEAPAICGKAGSDGILIAHEHCDKFYKCSHGKNVSMSCPSGLLYNPYKGWCDYPSNVKCGDRVIPNPKEDKPDSEDGDCNDDNGNDDNNGGDNNGPCNCVPDEAPAICGKAGSDGILIAHEHCDKFYKCSHGKNVSMSCPSGLLYNPYKGWCDYPSNVKCGDRVIPNPKEDKPDSEDGDCNDDNGNDDNNGGDNNGPCNCVPDEAPAICGKAGSDGILIAHEHCDKFYKCSHGKNVSMSCPSGLLYNPYKGWCDYPSNVECGDRVIPNPEEDKPDNEDGDCDDDNGNDDNNGGDDNGPCNCDPDKAPAICGQPGSNGILIAHEHCNKFYQCSNGKNVTMNCPAGLLYNPYKKLCDYPSKVKCGDRVIPDPEEDKPDNEDGDCDDDNGNDDNNGGDDNGPCNCDPDKAPAICGQPGSNGILIAHEHCNKFYQCSNGKNVTMNCPAGLLYNPYKKLCDYPSKVKCGDRVIPNPEEDKPDNEDGDCDDDNGNDDNNGGDDNGPCNCDPDKAPAICGQPGSNGILIAHEHCNKFYQCSNGKNVTMNCPAGLLYNPYKKLCDYPSKVKCGDRVIPNPEEDKPDNEDGDCDDDNGNDDNNGGDDNGPCNCDPDKAPAICGQPGSNGILIAHEHCNKFYQCSNGKNVTMNCPAGLLYNPYKKLCDYPSKVKCGDRVIPNPEEDKPDNEDGDCDDDNGNDDNNGGDDNGPCNCDPDKAPAICGQPGSNGILIAHEHCNKFYQCSNGKNVTMNCPAGLLYNPYKKLCDYPSKVKCGDRVIPNPEEDKPDNEDGDCDDDNGNDDNNGGDDNGPCNCDPDKAPAICGQPGSNGILIAHEHCNKFYQCSNGKNVTMNCPAGLLYNPYKKLCDYPSKVKCGDRVIPNPEEDKPDNEDGDCDDDNGNDDNNGGDDNGPCNCDPDKAPAICGQPGSNGILIAHEHCNKFYQCSNGKNVTMNCPAGLLYNPYKKLCDYPSKVKCGDRVIPNPEEDKPDNEDGDCDDDNGNDDNNGGDDNGPCNCDPDKAPAICGQPGSNGILIAHEHCNKFYQCSNGKNVTMNCPAGLLYNPYKKLCDYPSKVKCGDRVIPNPEEDKPDNEDGDCDDDNGNDDNNGGDDNGPCNCDPDKAPAICGQPGSNGILIAHEHCNKFYQCSNGKNVTMNCPAGLLYNPYKKLCDYPSKVKCGDRVIPDPEEDKPDNEDGDCDDDNGNDDNNGGDDNGPCNCDPDKAPAICGQPGSDGILVAHEHCDKFYKCSNGKNVSMNCPAGLLYNPYKGWCDYPSNVECGDRVIPKPEEDKPDNEDGDCEDDNNGGDNDGPCNCIPDEAPIICARQGSHGILIAHKHCNKFYVCVNGRNVTMNCPAGLFYNPYREVCDFPNNVKCGDRIIVDPEDEKPEEEEDNGNDDNNGEDGNGPCNCNPGEAPAICARAGSDGVFIAHEHCNKFYQCAHGRNVTISCPAGLLYNPYKKRCDYPDNVKCNDRIIPDPEEPDNENEDDNNGGDNDGSCNCVPEEAPEICAKIGSDGTLIAHQYCDKYYACMHGRNVTMRCPAGLLYNPYRQWCDYPNNVKCGDRINPEQDRDTCNCSLSYALTTCENENTNGKIIAHEICDRFYSCSNKEPVELLCPEGLLFDAKKQICDWPSNVDCGERIQ-