Monarch geneset OGS2.0

DPOGS209302
TranscriptDPOGS209302-TA873 bp
ProteinDPOGS209302-PA290 aa
Genomic positionDPSCF300234 - 467208-483094
RNAseq coverage15x (Rank: top 82%)
Annotation
HeliconiusHMEL0042362e-10974.72% 
BombyxBGIBMGA013755-TA3e-14091.29% 
Drosophiladpr-PA2e-8960.98% 
EBI UniRef50UniRef50_G6DPQ58e-155100.00%Putative defective proboscis extension response n=9 Tax=Endopterygota RepID=G6DPQ5_DANPL
NCBI RefSeqXP_973268.28e-10269.39%PREDICTED: similar to defective proboscis extension response, putative [Tribolium castaneum]
NCBI nr blastpgi|1892393931e-10069.39%PREDICTED: similar to defective proboscis extension response, putative [Tribolium castaneum]
NCBI nr blastxgi|1892393935e-9769.39%PREDICTED: similar to defective proboscis extension response, putative [Tribolium castaneum]
Group
KEGG pathwaymdo:1000280281e-08 
 K06550 (L1CAM)maps-> Axon guidance
    Cell adhesion molecules (CAMs)
InterPro domain[121-245] IPR0137834.9e-16Immunoglobulin-like fold
[150-245] IPR0035991.1e-10Immunoglobulin subtype
[46-129] IPR0130982.6e-10Immunoglobulin I-set
[150-232] IPR0131061.8e-08Immunoglobulin V-set
Orthology groupMCL16401 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209302-TA
ATGTTCAACCTAAACGCAGACCTGTACCCACATTACAGTTCAGATGTCAGAGAGATAATTAGGAGCTTGTTTGTTACAGGGTCAGCGCATTATGGTGAGGACAGCGTGGGCTCGTTTCCATACTTCGAGTATAATGTGCCGCGAAATGTTACCACCGTTGTCGGTCAGACCGCCTTCCTGCATTGTAGGGTTGAACAACTCGGCGATAAAGCGGTGTCATGGATAAGAAAAAGGGATCTTCACATTTTAACAGCTGGAATTTTGACATACACATCAGATCAGAGGTTCCAGGTCATAAGGCCAGATAAGTCTGAGAACTGGACTCTTCAAATCAAATTTCCTCAGGAAAGAGACGCTGGCATCTACGAATGCCAAGTAAATACGGAACCGAAGATGTCTCTGGCATTTCAATTAAATGTTGTCGAGGCCAAAGCAAAAGTTTTGGGTCCAGCAGACTTATACGTCAAGACGGGCAGTTTGTTGTCATTAACTTGTATCTTGAGTCAAGGACCGCACGATTTAGGCACCATATTTTGGTATAAGGGATCAAAATTAATAGAATACAAAGAATTAGAAGCAAATGAAGTGGAAGAGCAAAGGATTAGGCTCAAAACGGAATGGACAGAACAACTGTCGTCACGGTTAACTATCGACAAGTTACAACCGACGGACAGCGGAAATTACAGCTGCGTTCCCACAATGGCTGAAACTGCCTCCGTCAATGTACACGTTATAAACGGTGAACATCCAGCTGCGATGCAACACGGAAACACAAACACAGCATCTCCTTGTGCGCTGTCGGTTAATCTTCTCATCTCACTGTATTGTTCCCTCGCTACCTTATACACAAGCACCGTTGACGGACTTAGATGA

Protein sequence:

>DPOGS209302-PA
MFNLNADLYPHYSSDVREIIRSLFVTGSAHYGEDSVGSFPYFEYNVPRNVTTVVGQTAFLHCRVEQLGDKAVSWIRKRDLHILTAGILTYTSDQRFQVIRPDKSENWTLQIKFPQERDAGIYECQVNTEPKMSLAFQLNVVEAKAKVLGPADLYVKTGSLLSLTCILSQGPHDLGTIFWYKGSKLIEYKELEANEVEEQRIRLKTEWTEQLSSRLTIDKLQPTDSGNYSCVPTMAETASVNVHVINGEHPAAMQHGNTNTASPCALSVNLLISLYCSLATLYTSTVDGLR-