Monarch geneset OGS2.0

DPOGS203288
TranscriptDPOGS203288-TA6345 bp
ProteinDPOGS203288-PA2114 aa
Genomic positionDPSCF300003 - 1591023-1611133
RNAseq coverage94x (Rank: top 62%)
Annotation
HeliconiusHMEL0063840.083.69% 
BombyxBGIBMGA012237-TA0.054.83% 
Drosophilakon-PB0.035.28% 
EBI UniRef50UniRef50_E0VHI20.042.99%Chondroitin sulfate proteoglycan 4, putative (Fragment) n=1 Tax=Pediculus humanus corporis RepID=E0VHI2_PEDHC
NCBI RefSeqXP_002425576.10.042.99%Chondroitin sulfate proteoglycan 4 precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420096100.042.99%Chondroitin sulfate proteoglycan 4 precursor, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420096100.042.99%Chondroitin sulfate proteoglycan 4 precursor, putative [Pediculus humanus corporis]
Group
KEGG pathway 
InterPro domain[55-143] IPR0089852e-09Concanavalin A-like lectin/glucanase
Orthology groupMCL10986 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203288-TA
ATGAGTGATCGGGAGGATATCGTGCCTAGTGAAAAGAAGAGTTCAAGTGGAGGTAGCTCTTCGGAATGTCTTTCGGATCATGACGAGCCGCCAAGGAAACGCCAAAGGGGAGCTAGTCTGCAGGATCTGGAGGAAAGGTTTGACATATTGTCACAGCAGTTGCTTTCCGTTGACAATGTGGCGTTAAGTACTCGCATCGAAAGCGGCGGTAACAGATATCTAGACCTCTCTGATACTTTCTATCTAGGTGGTATAGAATCTGAGAAACGACAAAGAGCCTTCGCTCGAGGCGTCAAAGCAGCTGATTCAAGTATCATGGGTTGCATCAAACCAATCGAGGTGGACGACAGGTTGTATGGTCTGCCTAACGCGGTTGTCACATACGGTATTAGTCCGAAGTGTGTTTGGTGGTACCCCTGCCAGAGCGCCCATCCATGCGTGTCACAGGCCGTCTGCGAGCAACACGGTCTAGATCACTTCACTTGCAAGTGTGATAGCGATCTCTGCATAAATCCCGACTACGCTGAAAAATATAAGGTTTTTTCAAAATCGAGTAGCGAATTGGAACTGGTAACGTTATATCCTCTGACCGTCCAAGAGGGTGGTGTGGCTGTTATAACGTCACAGAACATAGATGTCGTATTGGATCACCATAAGTATGGCGTGCGTCCGTCTGGGGTACTGCTGCATGTGGCGAGGTCGCCACAACACGGCCGCATAGCCATAGACCTCTCGCTACAGAGGAATGCGCCACAGTATACAAATTATGTGGACGGTGAAAAAGCCAAACAGTTTTTCACGCTCATGGACCTTACCAGAGATAAGGTTCGCTACGTCCACGACGGCTCAGAAAATCACCAAGACGCCATAGTATTAGAGATGGAACTCATACCAGAGACGAAGTTCACACTGCCAAGCTACCTCCAAGGACGCAACACGTTTGTCCTACACGTTAACGTGACGCCGGTCAACGATCCGCCAGTACTGAATTTGTTGCCAGGGAAAATATTAAGATTGACCCAGGGAACACGAAAGGTTATAACCTCCGACATTTTGAAGGCAGAGGATCCTGATACACTGCCCGAAGATCTCTTGTATACCGTACTGCATGGAAAGAATGAAGCAAATAGCGGCCACATCGAGATGTCTGGTCAGCCTGTGGACTCTTTCACTCAGCAGGACATCGACTCCGGCATAATATCCTACGTTCACGGTACAACCAGCGACAAGCAGCTGAATAAAACATCGCTGAGACTGACATTACAGGTCTCCGATGGCATAGAAACTAGCGGGCCGGGCGTCCTCCGCATATCTATAGTACCGCTACAAGTGCGTCTCGTTAACAACACCGGACTGTTCCTAGTGCATAACTCGTACGCCATAATAACAGCCGACAACCTGACATTCGCCACTAACGCAGACGAAACCAACGTCCAAGTCAAATACGACGTAGTGAAACCTCCTCAGTTTGGCGTTGTCGAACGTCTCCGAGTACTAGATGGCACTTGGCAGACTGTTGACACGTTCACCAGTGAAATGATCAGTTCCGGTAGAGTCCGCTACATGCATATATTAGGAAACCCATCGCACGATGAATTTAAATTCAAAGCCTCCGTCGGCACAGTACGGACGAACACTCTATACGATTTTCGATTGACTTTCATCAAACTCGAACTATATCAAATGACGAACGAAGAATTGGTTTTGAACAATACCAGAGAGGCGTTCGTTTCCGATCAACACTTGCGTTTCAAAACGAAGCCGCTCGCGCTGACGGGTGACAGGATACTATTCACAATAATTAAACCGCCGAAGTATGGTATCCTACATCTGTCGTCTGGTAAACATCATTTGCAACTGCACAGCACTTTCACACAGCACGATATAGATTCGGACCAGCTGTGGTACAGATTACACAGACGCGCATACTCTCACATACAGGACGAGTTCACTTTCGTGGTAGGGGCTACGGAATGTGAGAATATCACAGGAGTAATGACAATAAGACATGTGCCGGGCACATCCAGTAGCGATCACTTGTCAGGGAGGATACACACCACGTTGGAGAGATTGCAAGTCATAGAAGGTTCCAGAATGGCGATACCAGCTACTCACCTTAATTTCAGAACGGATTCAATAACCAACCTAGTGTTCAATATAACGCGACCGCCCAAACACGGCAAAATCGAAGTTATCACCGATCATTTGAAAATACTGAGAGACAATACCACGTACTTCACTCTACAGGAATTGAATTCCGACAGAGTTTATTATACCCACGACGATTCGGAAAGCAGGCACGATTCTTTTCATTTCATGGCATTAAGTCCTGAGCCGGAAGACTTTCAGTACGTTGGAGTTTTCCATATCGACATCATACTGAAAAATGACAATAGTCCCGTGCGGGCGAATGAAAAAGTGTTTCATATAGTCCACGGAGGGGCGAGGCTTATAATGGCTAGGGATTTGAGTTACACCGACGCTGACTTGGACACTAAGCCTTCAGACATAGTGTATACCGTACAGAGATTCACGAAAGATCCTCCAAACGGCGGCATATTCCGTGCAGATAACCCATCCGAACAAATTGCTCAGTTCACGCAGGACGACATTAATAAAAACCTTGTAATGTTTAAGCATCAAGGCAAAGAGTACGGCAAAATAGCGTTTTGGATATCAGACGGGCTATTCGACGTGAACGGTAATTTGGAGATACAAGCTTCACCTCCCTTCATAAGAATGTATCCAACTAACGGTTCAATTGTAGAGAATGGTAAATCCGTTGTCTTAACTACCAAATACATGCAGGTGGACACTAACATGAATTGCCTTGAAGAAGATATCAGATACGAAATTATACAAGAACCCAAACAGGGGTCTATAGAAGTTGGTGAAATTTTGGGAGCAATTGCATTCACTCAATTGGACATAGCGGCTGGAAGAGTGGCCTATAAACACAGGGAACCGGAAACGCAAAACGATGCTTTTAGGTTTAAAGTTACGTGCCTTGAGGCCTGGGGTGAGGGTATATACCCTATTAAGATATTTCCGTCCAGTTACTGGGAACCTCTAAAATTAACGAATAATAAAGCATTAGTTGTCGAAGAATCAACTAGCCTGAATATCACGAGAGATATACTAGAAGTCATGCATCCGCAAATTGAACCTTCAAATATTCTGTACCAAGTCACCGATGGCCCGTACCACGGTTGGCTCGAAGTTACAGCAGTGGGTACGGTTGAATTGGAGAATTACAACGAGGAGCCAGTGCAAACTAAAGTGTTCGATCAATCTATCATAAACTCAAATAGATTAGTCTACGTACAGGCCGGTGTGAATCGAACCAGAGACAAAATCAAAATGGACGTAACCAATGGGATCGTTTGGCTCAGAGGAATAGAGCTTACTGTCATAATAATACCGGAGCATTTTTACGTAGTCTCCTCAAACCTGACGGTGGTGGAGGGGATGTCCGTCAGTATCAAGCAGGACTTGTTCAGTACGGTCACGGAGTACTACCGCGGGCGAGTCGTCTCCTACAAAGTTGTTCAGAATCCGAAATACGGCAAGATCGTTATGGATGAGCAGGAATTGACATTGCTGCCTGTGCTTAAGTTGAATTCCGGAAATATCGTGTACACTAATGACGGTTCTGAAGAGTCAACTGATGTGATAAAGTTGGTTGCGATAACAGAAACCGGTAAGGAAAGTGAACCGTTCTATCTCCGCATCAATATAGAGCCAGTTAACGATGAACCACCAATCGTGGCCGCTAACACCGGCCTCTGTGTGTGGGAAGGTGGCACATTTACGTTTACTAGAAATGAACTTTATGTAAACGATATTGACACGCCATTAAGAAACGTCACAATTAGGGTAGTGGATATTGTCTCTGGCTACATCGCCACACGAGGCGATCTAGACACTCCCATAGATCACTTCACACAAGCGGATATTGATAACAGATATGTCGTATTTGTTCATAAAAACGGATCCAAGGGCAAGATGATATTTAACGTAACCGATGGTCTTCACGAACTATCAAAAATAACATTCCTTATAACAACAAAATCTGTGTCCCTTAAGCTGGTCAGAAAACATTCATTGCGAGTATTCCCTCTGATGAGAGAGCCGCTCAACAATTACCTCCTGATGGCGAAATGCACAGATCCATCCAGACCGATAGTTTTTAAAATTGTAAGAGCACCAGCTTTAGGTAGGCTGGTTATGCTGAGTGGGGATAACCATCACAGATCCGTAACACAATTCACACAGAGGGATATAAATGAAACAACAGTCTATTACGAACATACTCACCCATTTTCTGATCTTTATACTAACGACTCCTTCATATTTAAAGTGGAGGCGGCATTGGCTAAGCCGGTACTAGATCAAATCTTTCATATCGATATATCAGTGGCATCCGGTGGTTTGGCAAAGTATGTGAATATTCCATTGACCAAAGTCAAGGAAGGCGACAAAATTCCATTGCGTGTGAACGTGACTAATGTTATAACATATTTGGAGACACAGGCTGGTGTTAGACAACCACAAATTGAGGCACAGTGGTCGTTACCGATGCACGGTGTCTTAAATCCTTTGTTATCTTCGCTTACACAAAGCCAGCTGGAAGATGGTGTCGTAACATATGAGCATGATGACTCGGACACGGTTGAAGACAGTATAGATATGGCGCTGTATTTGCTACCAGATTATGTTCTCTTGTGTAATGTCACTATACCGATTCATATTGTGCCAGTAAACGATCAGCCGTTCAGGTTGTTAACGGACACTCCACAAATACAGGTGGTGCAAGGAGAAAATTATACTTTAACTAAGAATGATTTGCTCACTGAAGATGGTGACACTGTGCCATCGGGTATACTCTATGATATAATAAGTGGCCCGACACAAGGTAGACTAGTCATGATGGATGAAAATCAGACACTAGACGAGGCGCAATCCATAAACAAATTCACCCAACAGGATATAAATGAGGGTAGGATTGTATATGAACATTCAGGCATATTGCAAACAGCGACATTCTATTTCCGGGTATGGGACGGAGAGTTCAAACCGACCTATACGGTTTTCACTATAGACGTCATACCAGTTATATTGAACGCGTCATCACTACATCCGATATTCTTGCAGCAAGGTTCAAACGTGGCGACTGTGGCACCAGATCAAATATATGTAGAGACCAATGCCAAAAAAGATAAAGTCTGGTACAATATAACAAGACAACCAGTTCACGGGATGATATACTTGGGAAGGAATCCTGTAACTTATTTTTCACATAAGGATATAATGGATAAAGTAGTCATTTATATGCAGAATGATATGACGGTAGCGAATGATAGTTTTGATCTGATCGCCTATGTCCATAACAGCAACGCCACACAGCCTTTCACCATAGACGTTGTTGTGCAACCGTTACTAGTATTGGGGGATTTGAAAATTATTGAACAGAAAACAAAAATAACATTAAATAATTTGGATGCGAATGAGTTGGCAAAACTAACAGCGAGCGATCCGGTTTACACGATATTGAGGAAGCCAAATTATGGAAGCATAAAAAAGATAATAAGAAGCTCTGGCGAGAAAACCAGTGCGAGGGAGAGGGAGATAGCGTATTTCACTCACGAGGACATTAAAGCTGGTGTCATATACTATGTGGCCAGGAAGAAATTGGCTGCTCTGAACGGTGTCCAAGACAGTCTTGGTTTCCTACTCGCTGCGACAATATTCCAACCAGCCACCGGTGAGCTTGATATTTATATCGGCAAAAAGGGTGACAAGAAAAGTTTACTGGGACCAAGCGATCCTGAAGGCCACGAAGGGATTCCGGTAAAAAATGGACAAACATCGTCATATTATATGATGGTGATAATGACGTTGCTTGGTGTACTCCTGGCTGTTATAATACTAGTCAGTCTATTGAAATGTCGTCGTTATATGACAAGGGATCAGAACGCCATGGTGAAAATACACGGACAGAGTCAGGGCGCGGTTGCTCCTATACCGTTACCTCGACCACCAGACCACTTGATGCCGTCACCAACCCAAGCCAGTCCTCCAATAAAGAGATATGTGTCTTCGGAGCAATCGGTACACACTGGCACCAGCACTCCTCTACCGTCAGGTGGTAGTGTAGCTTGTAAGGTGACCCCGTTAGCGGACGCTGGCCTTCCAGACCTCAACGCAAGGTATCCTTATGGAGCCGATGACCATACCGATGCGGAAGATTGGAGCAGCTATGAGGCTAGCGAGTCAGCCTTCCCGGTCCGCTCAGGCGGTGTCCCCACCAACCCGATGCTGCGCCGCAACCAGTACTGGGTCTGA

Protein sequence:

>DPOGS203288-PA
MSDREDIVPSEKKSSSGGSSSECLSDHDEPPRKRQRGASLQDLEERFDILSQQLLSVDNVALSTRIESGGNRYLDLSDTFYLGGIESEKRQRAFARGVKAADSSIMGCIKPIEVDDRLYGLPNAVVTYGISPKCVWWYPCQSAHPCVSQAVCEQHGLDHFTCKCDSDLCINPDYAEKYKVFSKSSSELELVTLYPLTVQEGGVAVITSQNIDVVLDHHKYGVRPSGVLLHVARSPQHGRIAIDLSLQRNAPQYTNYVDGEKAKQFFTLMDLTRDKVRYVHDGSENHQDAIVLEMELIPETKFTLPSYLQGRNTFVLHVNVTPVNDPPVLNLLPGKILRLTQGTRKVITSDILKAEDPDTLPEDLLYTVLHGKNEANSGHIEMSGQPVDSFTQQDIDSGIISYVHGTTSDKQLNKTSLRLTLQVSDGIETSGPGVLRISIVPLQVRLVNNTGLFLVHNSYAIITADNLTFATNADETNVQVKYDVVKPPQFGVVERLRVLDGTWQTVDTFTSEMISSGRVRYMHILGNPSHDEFKFKASVGTVRTNTLYDFRLTFIKLELYQMTNEELVLNNTREAFVSDQHLRFKTKPLALTGDRILFTIIKPPKYGILHLSSGKHHLQLHSTFTQHDIDSDQLWYRLHRRAYSHIQDEFTFVVGATECENITGVMTIRHVPGTSSSDHLSGRIHTTLERLQVIEGSRMAIPATHLNFRTDSITNLVFNITRPPKHGKIEVITDHLKILRDNTTYFTLQELNSDRVYYTHDDSESRHDSFHFMALSPEPEDFQYVGVFHIDIILKNDNSPVRANEKVFHIVHGGARLIMARDLSYTDADLDTKPSDIVYTVQRFTKDPPNGGIFRADNPSEQIAQFTQDDINKNLVMFKHQGKEYGKIAFWISDGLFDVNGNLEIQASPPFIRMYPTNGSIVENGKSVVLTTKYMQVDTNMNCLEEDIRYEIIQEPKQGSIEVGEILGAIAFTQLDIAAGRVAYKHREPETQNDAFRFKVTCLEAWGEGIYPIKIFPSSYWEPLKLTNNKALVVEESTSLNITRDILEVMHPQIEPSNILYQVTDGPYHGWLEVTAVGTVELENYNEEPVQTKVFDQSIINSNRLVYVQAGVNRTRDKIKMDVTNGIVWLRGIELTVIIIPEHFYVVSSNLTVVEGMSVSIKQDLFSTVTEYYRGRVVSYKVVQNPKYGKIVMDEQELTLLPVLKLNSGNIVYTNDGSEESTDVIKLVAITETGKESEPFYLRINIEPVNDEPPIVAANTGLCVWEGGTFTFTRNELYVNDIDTPLRNVTIRVVDIVSGYIATRGDLDTPIDHFTQADIDNRYVVFVHKNGSKGKMIFNVTDGLHELSKITFLITTKSVSLKLVRKHSLRVFPLMREPLNNYLLMAKCTDPSRPIVFKIVRAPALGRLVMLSGDNHHRSVTQFTQRDINETTVYYEHTHPFSDLYTNDSFIFKVEAALAKPVLDQIFHIDISVASGGLAKYVNIPLTKVKEGDKIPLRVNVTNVITYLETQAGVRQPQIEAQWSLPMHGVLNPLLSSLTQSQLEDGVVTYEHDDSDTVEDSIDMALYLLPDYVLLCNVTIPIHIVPVNDQPFRLLTDTPQIQVVQGENYTLTKNDLLTEDGDTVPSGILYDIISGPTQGRLVMMDENQTLDEAQSINKFTQQDINEGRIVYEHSGILQTATFYFRVWDGEFKPTYTVFTIDVIPVILNASSLHPIFLQQGSNVATVAPDQIYVETNAKKDKVWYNITRQPVHGMIYLGRNPVTYFSHKDIMDKVVIYMQNDMTVANDSFDLIAYVHNSNATQPFTIDVVVQPLLVLGDLKIIEQKTKITLNNLDANELAKLTASDPVYTILRKPNYGSIKKIIRSSGEKTSAREREIAYFTHEDIKAGVIYYVARKKLAALNGVQDSLGFLLAATIFQPATGELDIYIGKKGDKKSLLGPSDPEGHEGIPVKNGQTSSYYMMVIMTLLGVLLAVIILVSLLKCRRYMTRDQNAMVKIHGQSQGAVAPIPLPRPPDHLMPSPTQASPPIKRYVSSEQSVHTGTSTPLPSGGSVACKVTPLADAGLPDLNARYPYGADDHTDAEDWSSYEASESAFPVRSGGVPTNPMLRRNQYWV-