Monarch geneset OGS2.0

DPOGS202409
TranscriptDPOGS202409-TA1869 bp
ProteinDPOGS202409-PA622 aa
Genomic positionDPSCF300233 - 36836-42354
RNAseq coverage788x (Rank: top 16%)
Annotation
HeliconiusHMEL0162401e-17179.40% 
BombyxBGIBMGA003304-TA0.065.63% 
Drosophiladlp-PB6e-4847.50% 
EBI UniRef50UniRef50_B0WIX17e-9639.62%Glypican n=2 Tax=Culicinae RepID=B0WIX1_CULQU
NCBI RefSeqXP_001656399.13e-9835.64%glypican [Aedes aegypti]
NCBI nr blastpgi|1571347015e-9735.64%glypican [Aedes aegypti]
NCBI nr blastxgi|1700418367e-10037.42%glypican [Culex quinquefasciatus]
Group
Gene OntologyGO:00433952e-124heparan sulfate proteoglycan binding
GO:00160202e-124membrane
GO:00055782e-124proteinaceous extracellular matrix
KEGG pathway 
InterPro domain[1-578] IPR0018632e-124Glypican
Orthology groupMCL10665 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202409-TA
ATGTTCAAACGAACGTACGGCATGATCTACGAACAGCACTCGTATGTCTTCGAACAACTCTTTGAACAGTTGGAGAGGTACTACACACGAGGAGACAGCGACTTCGACGAGATGATGGACAGCTTCTTTGGGATCCTGTATCAGAAAATGTTCGCCGTTCTGAACTCACAGTATACCTTTGATGACAAGTATCTGAAATGTGTGAACGAGCACATGCGGGACATCCAGCCATTCGAGGACGTGCCATCAAAACTATCAGCGCAGCTGAGACGAGCGTTCGTCGCCACACGCACCTTCCACAAGGCGCTGCGTGCTGGAGCTGATGTTGTCAGGAATATGATGCAGGTGGGTGTAACCCAGGAGTGTGTTGCCGCATGGGCTCGTCTTCGTTACTGTGGCTCCTGCGCGGGTCACCAGGTCCCGGCCTGTAGTCGCTACTGTCACAACGTCATCCGCGGCTGTCTGCCTACACACGCAGACCTCGGAGACCAGTGGGATGCCTATGTTGATGCCGTCGAGAAGGTAGCAGATCGCCTACTCGGACCGTTCAACATAGCAATGGTCGTGGAACCGATTGACATTAAAATATCCGAAGCCATCATGAGCTTCCAGGAACGTAACCAGGAGATCTCGCAGAAAATCTTCTCTGGTTGCGGGAAACCGGTTTTGGGGGGTGGCGGCAGCACGGGGCCGTTCTTCCCCCCGGGCAGGAACAAACGTTTCGCCCGATCGATACCCGACTTCGATTGGAATCATAAACCTAATGATGTGGACGATTTTGAAATCGAGGCATCCTTTGAGAGTGTTTTCAACGACGACCCGTCGCTCATGAGCCTCCGGACGCCCGAGGGCATTCGCAAAGCGACGGAGGAAATGGCAGAGAATGCTAAGTCGAGGGAGCGTTTCCTCCAATACATGAGAGGACAGATTCAACTCGAAGAGTACGAGGAACACGAGCGGAGCAAACGTGATGCGGACCCTGAACCGGCGGCGCAAGGCGGTTCAGAAATAGACTACAAGTCGTACGAGTTTGAAGGCAAGCGAGGCTCCAAGAAGAAGAAGCTGACGGCGGCCAAGGCTGAAACCGGGCACGGTGCGGACACTGGTCCCGAGTTAGAGAAGTTGGTCCGCGAGACTCGTTCCCGTGTCCGAGCTTCCCGTCGCTACTGGCTCCACCTGCCGGCGCTTCTGTGTGCCACCGCCAGTGTTACCACCGCGCCCTGCTACAACGGCAGTCACGTGGCCAGCTACACGTCGATAGCAGCTGGTGACGGCTCCGCGGCGCTGGCCTCCAACCCGGAGGTGCGGTCCCCACCACCACCACCCCCGCCCGCGAGCGACGCGTCACCGCTGGAGGCTCTCCGCTCACTCACCGGAAGGCTCAAGGATGCTTACAACGGAGTCGAAGTGCACTGGATGGATACAGCCGAAGACCTGCAATCTGCGGCCGCGTCTCAGAGCGAGTTCATAGACAATGGCGGATCAGGGTCCGGCTCCGGAGACGACACGGACGACACAGAGGATCTGCCCGACGACGACGAGGACAGAGACCCCAGCAAGGACTATGAAGGCTCCGGCATCAGCGAGTCGCCGCTGGACCCAACCGACACGGAGACAGAGAAACAACAGCCGACCGAGACCGAGGAGCCGGTGGTGCCGAGCGTAGTGAACGTGCCGGGCACCAAGAACGTCAACCTGCCCTCCGCCATAGACGCCGGCGACGACGCCGTGGACGTCCGCGGCCGCGTGGACGAGCCGCAGCCCGCGGCGGCCGGCGCCGAGCGACCCTCGCTGCAGAACGCGCTGTTCACGTACGCGCTGCCCGTCGTCTGCGCCTGGTTCGGCTCCATCGTCACCGACCTGTTTTGA

Protein sequence:

>DPOGS202409-PA
MFKRTYGMIYEQHSYVFEQLFEQLERYYTRGDSDFDEMMDSFFGILYQKMFAVLNSQYTFDDKYLKCVNEHMRDIQPFEDVPSKLSAQLRRAFVATRTFHKALRAGADVVRNMMQVGVTQECVAAWARLRYCGSCAGHQVPACSRYCHNVIRGCLPTHADLGDQWDAYVDAVEKVADRLLGPFNIAMVVEPIDIKISEAIMSFQERNQEISQKIFSGCGKPVLGGGGSTGPFFPPGRNKRFARSIPDFDWNHKPNDVDDFEIEASFESVFNDDPSLMSLRTPEGIRKATEEMAENAKSRERFLQYMRGQIQLEEYEEHERSKRDADPEPAAQGGSEIDYKSYEFEGKRGSKKKKLTAAKAETGHGADTGPELEKLVRETRSRVRASRRYWLHLPALLCATASVTTAPCYNGSHVASYTSIAAGDGSAALASNPEVRSPPPPPPPASDASPLEALRSLTGRLKDAYNGVEVHWMDTAEDLQSAAASQSEFIDNGGSGSGSGDDTDDTEDLPDDDEDRDPSKDYEGSGISESPLDPTDTETEKQQPTETEEPVVPSVVNVPGTKNVNLPSAIDAGDDAVDVRGRVDEPQPAAAGAERPSLQNALFTYALPVVCAWFGSIVTDLF-