Monarch geneset OGS2.0

DPOGS203287
TranscriptDPOGS203287-TA918 bp
ProteinDPOGS203287-PA305 aa
Genomic positionDPSCF300003 - 1628577-1631786
RNAseq coverage71x (Rank: top 66%)
Annotation
HeliconiusHMEL0063841e-15991.07% 
BombyxBGIBMGA012236-TA2e-14485.22% 
Drosophilakon-PB3e-5738.49% 
EBI UniRef50UniRef50_UPI0002247B4F4e-8550.72%UPI0002247B4F related cluster n=1 Tax=unknown RepID=UPI0002247B4F
NCBI RefSeqXP_001599803.14e-8550.91%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3287923391e-9153.20%PREDICTED: chondroitin sulfate proteoglycan 4 [Apis mellifera]
NCBI nr blastxgi|3407090292e-8952.84%PREDICTED: LOW QUALITY PROTEIN: chondroitin sulfate proteoglycan 4-like [Bombus terrestris]
Group
KEGG pathway 
InterPro domain[13-189] IPR0089855.5e-40Concanavalin A-like lectin/glucanase
[19-185] IPR0133207.1e-35Concanavalin A-like lectin/glucanase, subgroup
[36-167] IPR0017911.6e-27Laminin G domain
[44-166] IPR0126802e-26Laminin G, subdomain 2
[227-286] IPR0126793.8e-08Laminin G, subdomain 1
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203287-TA
ATGGAACTCGTGTTTGTATTGAGTTTTGTTTCTATTGGATTAGCTTACGATAAAGCGTCCTTCTACGGTGCTAGCTATATCTCATATCCACTTCAAGAAGCAAAAGGTATTACCGACATAAGTTTTAGATTTCGCACTCATCTATCTGACGCTCTTTTACTGCTGGCTGCTGGTAAAACCGATTACTGTATGATCCGTCTGGAGGGTGGCAAGTTAAAGCTGCATATCAACCTTGGGGCCGGTGAGAGTGAACTTTCTTCAGCAAAAGGTACTTACCTTAATGACACTCAATATCATCATGTAAGCATCATCAGAAGAGAAGCAAACCTTACTATGAAGGTGGACGACAGCGTTGTTAAGAAAAAATTGCCGGGGAGGTTTTTCGAGCTGAACATACACTTCGGTATCTTTTTGGGAGGACAAGGAGATTTCTCGGAACTATTTCTTGGCCACATGGAAAATTTCCGCGGCTGCATGGAAGATGTGTACTACAATGGGGTCAAGATAATAGAGAAGGCTCGCAGCCGCAGCGGTTCCGTTCACGTTGAAGGTGTAACTTGGAATTGCGCCTTGGAGTTCGACGCCGACATTAGTTCCGACATAAGTTTCATTGACGAAGGGGCGTATTTGATCCTACCTAAGATAAACTCGAGAGCCGGTGGCAGGTGGCAGATAGAGTTCAAGACGATAACACCGAACGCTGTAATACTATACAATCCCGGTGGTGGTCGCGGCTCAGATTTTCTGGCTGTGGAGATGTTGGAGGGAGTGGTCAGGGTGAAGATGGCTAAGAGTCAGATCGTCCACACGGCGAGAGTCAACGACGGGCAATGGCACAAAATGCACCTGATGTTCAATCCGTCGGTCATCGAGCCCATGCGATATGAACGTACATGTGTATCTCACCTGAGACATTGA

Protein sequence:

>DPOGS203287-PA
MELVFVLSFVSIGLAYDKASFYGASYISYPLQEAKGITDISFRFRTHLSDALLLLAAGKTDYCMIRLEGGKLKLHINLGAGESELSSAKGTYLNDTQYHHVSIIRREANLTMKVDDSVVKKKLPGRFFELNIHFGIFLGGQGDFSELFLGHMENFRGCMEDVYYNGVKIIEKARSRSGSVHVEGVTWNCALEFDADISSDISFIDEGAYLILPKINSRAGGRWQIEFKTITPNAVILYNPGGGRGSDFLAVEMLEGVVRVKMAKSQIVHTARVNDGQWHKMHLMFNPSVIEPMRYERTCVSHLRH-