Monarch geneset OGS2.0

DPOGS209028
TranscriptDPOGS209028-TA3867 bp
ProteinDPOGS209028-PA1288 aa
Genomic positionDPSCF300102 - 284356-291137
RNAseq coverage21x (Rank: top 79%)
Annotation
HeliconiusHMEL0052760.062.09% 
BombyxBGIBMGA010029-TA0.055.64% 
DrosophilaCG14608-PC1e-3647.86% 
EBI UniRef50UniRef50_UPI0002061E651e-4244.22%UPI0002061E65 related cluster n=1 Tax=unknown RepID=UPI0002061E65
NCBI RefSeqXP_001942936.16e-4350.30%PREDICTED: similar to CG14608 CG14608-PB, partial [Acyrthosiphon pisum]
NCBI nr blastpgi|3287142844e-4244.22%PREDICTED: hypothetical protein LOC100159478 [Acyrthosiphon pisum]
NCBI nr blastxgi|3227948411e-5826.05%hypothetical protein SINV_15483 [Solenopsis invicta]
Group
Gene OntologyGO:00080619.7e-14chitin binding
GO:00060309.7e-14chitin metabolic process
GO:00055769.7e-14extracellular region
KEGG pathway 
InterPro domain[99-160] IPR0025579.7e-14Chitin binding domain
Orthology groupMCL25682 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209028-TA
ATGATTTCCTTAAAAACAATCTGTGTTTTAGTCGCTTTAAATTTGGTGCAAACTTGTCACTGTGCTCGACAACTGGCGAATCGAAGAAAGGATACATCGTCCTTGTATCTTCCCGAACCAAGTGCGCAGTCACTGACAGCGATCGCACAAGCGATGGGAGCGGCGGGATTCGAAGACTACACGGAAGGAAAGACACTCGTCAAGCGACTGATAACGGCCGATGACCAGTCTGAGCTTGATGTTGTTGAACATATGGGTGTAATAGGAAAGGCAGGTGTTGACTTCCCAGCTCTGCCCAATATCCCCAAAACCGGATTCAACTGCAAGAACGTGCCCACGGGTTATTATGCTGACTTGGAAACCGATTGTCAGGTATTCCATATCTGTGACACGTCTCGCAAGATATCGTTCTTGTGTCCAAATGGCACCATCTTCAGTCAGTCGCATCTCATCTGTGACTGGTGGTTCAAGGTGGACTGTGCATCCGCACCGGCTCTGTACGAGGCTAGTGTAGAGTACTACTCCAATGAACAAAAAAAGTCTCAGAAAGTAGGAAGAACTCTCTCCAAAAACTCTAATAACCGAAATGTCGGTGCCGATTCTCAGGTTCGAACCGAATCTAGAAAAGCACCTATACCCTCGACAACAGAAAAACTTCTAAGACAGTCCCAAAATTCACAAACTAATGAACCTGTACCCACTGACGTTCCTACAACCAGAAACTATCAGACAATCTTTGATATTAATCCAACGTATGCTACCACTGAATTTGAAAACCGGAAAAAGAACCTCGTTCAGCTTATTTCAAATGATTTCAGCAACTATCCATCTAAAACAACTCTACCTGTATACGACTCAACAACCCGCAAAACTACTGAACCAGTATATGACCATAACAGCTTGAGGGAAATGCAAGTAGCTGCGGAAACAGCTTCTTTTGCCCAAAACCAAAATCGACAATTTTTACAAGAATATAATAGTAAGAACCTCAGACCATACCCTGTATATAACCATAATTTACAAGTCAAACCATCCAAAACTAAGACTCTTACCACTTTATATGATATAACAGCCACCAATGCTCCTCAATATACACAGCAAGCAACAACAAAACGACAAACTTTACTACCTTACACCAAAAGTTACACAATTAACGATAATCGTGATCCTTACACAAGGCCAGGAGTTTCTTTGCTAAGGGAGTTTCTAGAGAAGGAAAGAAATAAAACTCTGCTTGCTACAACTGAAAAAATTGCTACAATTCGAAGCGATAAACAGAAAAGCAAAATAAACCCGGAGAAAAAAGGAGAAACAGATAACAGGAGCAGTTTTGAATCTACTTCTAAAATACCATATACCAGTAAAACCGTTGGACAAACTGAAACTATTTTAAACGTGGAACATTCTACTGAGCCAACGACGGAAATATCGTATAAAGATCGCAGGGAGAGATTATTAAGAAAAATTGCGTTAGACAGAAAGGACGCAGAAACGACGCCGCCGACTGTTGTTACGGAAAAATATTATGGTAACCAATCGAATAGACCCGGGCTTGTTGTACCACCATCACTAACTCCTAAAACGCTTCATTCGCTGGCTATATATTATGCCACAGCCTTAGATAATTTTGCTACAACACCCACACCTGAAGATACCGAAACAACTACGTATTCTATGGATATGTATGAAAAAGTCACGGAAGGGTTGCCACCTTTATTTAGCAAGCAGACAATAACTAAATATGGTAATCTCTTTGGACTTGGAACAGGAAACGACGAAATGCTTGAGAATATTAAAATAGACCCAAATAGCTCGATAAATGAACTCGCAGAAGATCTGTCAGCACAAATGAGTCAAGGACCGTTAGCTTCATCTCCACAAATAAGAGAATTAGCGCAAATATTTACACACGCACTCTCCGCTTATCTACAGGATCCAGTAAAATTTAGAAAAGTTTTATCAGACATTAGACCAACCCATCCATCTTTTTCTGATATGTTAGTTGATACAGACGCTTCATTTAATACAGAATCTACTACTACAGTCAACGAAGAAGACGACGAAATACTTGGATTTTCGGATGATCATAAAATCAGAGCAATAGAAAATTCCTTACGCAGTGGGAAATCAATAAATATCGCCACTGATTATCCAACTACTATTGAAGAAGTAACTACAACAGTTCAACCAACAACTGAAATTGAAACCACTACGACACCAAGAAGTCCCTTTAGATGTTGCGGAAGAATATCGGCTTCTTACACAACTGCTCCAACGCCCAGCAAACACTACTTTACTTCCATTCCATCTAATACCTTTGCAGCGGGGAAAATAAATTCATTAACTAATTATAATACTGAGACTCCTAAAAGTGAATATATCAATACAAAATTGTCAAACGGATATTTCATCAACTCCAATATAAAACAACTTCCCGTAACTGATTCTCAATTTTCCAACGATTACACTGAAACTACTACCTTGACAACAGAATCAGATATCGACATTTTTGATTACACTCTTTCTCCGATAACTAATTCACATCAGTTGTTTGAAAGTAAGAAAATAAAGACGACAACAAGTACACCGGAATCTACCCCTAAAAACTTCGCCGAGACTGACAGTATTGAACTAGAAAATGAAGAAGAACTCCAAAGAGCACACAGTCAGTCTTTTGTTACGCCTCAAGCAAATAGTGTCCGTAAAGGCAAGCAAATAAATCAATTTGTGAACAAAGAATTAAAGAAACCTGCTGAGGATTTAGAAGCACCGACTCAAGCATCAATAGATTTAACGACACTTGCACCAACTACTACCCAAGCCACTGCTTCAACTCTTTCAGACCAAACTTCGACTTTAACCACCGTGAATCCTTCACAAACTAGTATATTTACTTCTCCAGATAGTGAAAAAAATAATAACGATTTCCAATGGCCAACCACTTTTGGTAATTGGCAAAGCACAATCATAGATCCCATCACCCTTAACGATGGCTTAAGTTCTACTGGACCCGAGCAAGTAGTATCTGAAATATCTCAACAAACTAACGAATGGCCATTAGAGGCTGCGACGACTCAATCTACGTTCATAACAACCAATGAACCCGTCTCTACGACAACTGTTAATGTAGAAATAACAACAAACATCAAAAATTATGAAAGATTTGGTAGACTTCTTTCTGACCCTTCATCGACTGAAGCCTCTCAAGATATATCAACTGTTACAGACACTATCGTCGAAAAAGCAAAGCAAATAATGGGAGGAATGAATTCAACAACGACGCAAAAACTCATGAACGTCATGAAAAAAACGAAATCAAAGACAGTCAAACGTTTAATTCTCCTTTTAGTGCAAACGTGTGACGACGATCACAATTCGACAGCGGAAGCTTCAAAGAAAGCATTGCTAGAAGCTCTGATGGCCGTCTCGCAGAAAGATATGGACGAAATAGAAAAAGAAGAAGAATCAATAGAAACACATTCGGCAGAGTCTTTACCTGATGGGAAAACAAAGGAATTCGAACGCCGAATGGACAGGATTCAAGTGGAACCTAGACAGAATAAAAATATAAACACAGAGGCCGAGGAAGTCAACTCCTTATCAACTCCCACCACAACTGAGAGTTTTAAGACCGAAGAGACACAAACCACACCAGTCACAACAGCTAGAACTACACGAACGAGTCGAAGAGGAAGCAGAAAATATTCGTTTTCTACAGAATCAGAACAAACACATACCACGGCGGGAGACCGACCGCTCGCCGAGGCGAGGACAGCTCCGCGGCCCGAAGTCAAAACACAATCAGACACGAGGGCTTTGGAACTATTGAGATCATTGTACACCATCGCCGCGAGGTGGGGCAAATAG

Protein sequence:

>DPOGS209028-PA
MISLKTICVLVALNLVQTCHCARQLANRRKDTSSLYLPEPSAQSLTAIAQAMGAAGFEDYTEGKTLVKRLITADDQSELDVVEHMGVIGKAGVDFPALPNIPKTGFNCKNVPTGYYADLETDCQVFHICDTSRKISFLCPNGTIFSQSHLICDWWFKVDCASAPALYEASVEYYSNEQKKSQKVGRTLSKNSNNRNVGADSQVRTESRKAPIPSTTEKLLRQSQNSQTNEPVPTDVPTTRNYQTIFDINPTYATTEFENRKKNLVQLISNDFSNYPSKTTLPVYDSTTRKTTEPVYDHNSLREMQVAAETASFAQNQNRQFLQEYNSKNLRPYPVYNHNLQVKPSKTKTLTTLYDITATNAPQYTQQATTKRQTLLPYTKSYTINDNRDPYTRPGVSLLREFLEKERNKTLLATTEKIATIRSDKQKSKINPEKKGETDNRSSFESTSKIPYTSKTVGQTETILNVEHSTEPTTEISYKDRRERLLRKIALDRKDAETTPPTVVTEKYYGNQSNRPGLVVPPSLTPKTLHSLAIYYATALDNFATTPTPEDTETTTYSMDMYEKVTEGLPPLFSKQTITKYGNLFGLGTGNDEMLENIKIDPNSSINELAEDLSAQMSQGPLASSPQIRELAQIFTHALSAYLQDPVKFRKVLSDIRPTHPSFSDMLVDTDASFNTESTTTVNEEDDEILGFSDDHKIRAIENSLRSGKSINIATDYPTTIEEVTTTVQPTTEIETTTTPRSPFRCCGRISASYTTAPTPSKHYFTSIPSNTFAAGKINSLTNYNTETPKSEYINTKLSNGYFINSNIKQLPVTDSQFSNDYTETTTLTTESDIDIFDYTLSPITNSHQLFESKKIKTTTSTPESTPKNFAETDSIELENEEELQRAHSQSFVTPQANSVRKGKQINQFVNKELKKPAEDLEAPTQASIDLTTLAPTTTQATASTLSDQTSTLTTVNPSQTSIFTSPDSEKNNNDFQWPTTFGNWQSTIIDPITLNDGLSSTGPEQVVSEISQQTNEWPLEAATTQSTFITTNEPVSTTTVNVEITTNIKNYERFGRLLSDPSSTEASQDISTVTDTIVEKAKQIMGGMNSTTTQKLMNVMKKTKSKTVKRLILLLVQTCDDDHNSTAEASKKALLEALMAVSQKDMDEIEKEEESIETHSAESLPDGKTKEFERRMDRIQVEPRQNKNINTEAEEVNSLSTPTTTESFKTEETQTTPVTTARTTRTSRRGSRKYSFSTESEQTHTTAGDRPLAEARTAPRPEVKTQSDTRALELLRSLYTIAARWGK-