Monarch geneset OGS2.0

DPOGS215421
TranscriptDPOGS215421-TA876 bp
ProteinDPOGS215421-PA291 aa
Genomic positionDPSCF300088 + 1078009-1090895
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0174554e-13689.02% 
BombyxBGIBMGA012382-TA2e-11984.86% 
Drosophiladpr7-PC6e-9062.02% 
EBI UniRef50UniRef50_A8DZ278e-8862.02%Dpr7, isoform C n=21 Tax=Pancrustacea RepID=A8DZ27_DROME
NCBI RefSeqXP_001815475.16e-10377.22%PREDICTED: similar to defective proboscis extension response [Tribolium castaneum]
NCBI nr blastpgi|1892345461e-10177.22%PREDICTED: similar to defective proboscis extension response [Tribolium castaneum]
NCBI nr blastxgi|1892345469e-10077.22%PREDICTED: similar to defective proboscis extension response [Tribolium castaneum]
Group
KEGG pathwayhsa:33397e-07 
 K06255 (HSPG2)maps-> ECM-receptor interaction
InterPro domain[44-139] IPR0137832.9e-11Immunoglobulin-like fold
[43-128] IPR0131062.3e-10Immunoglobulin V-set
[44-138] IPR0035993.8e-10Immunoglobulin subtype
[162-244] IPR0130985.2e-06Immunoglobulin I-set
Orthology groupMCL16903 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215421-TA
ATGGGGCTCTTAGCCGTACAAAAGTTAAAGAAATTATTAATGAATTCGGTCGATGCGAAAGTTTGTGTTGCAGGTGTGAATGTAAACGTGGATCCTCGAGCTGAACGGCCTTACTTTGACGACGTGTCTCCTCGAAACGTGTCCACCGTCGTCGGCCAGTCCGCTGTGTTGAGATGTCGCGCCAAACACATAGGAAATAGAACGGTGTCGTGGATGAGGAAGCGTGATCTTCACATTCTGACTTCCCACATCTTCACGTATACCGGAGACGCCAGGTTCAGCGTGCTTCACCCTGAGCCCTCGGACGACTGGGACCTCAAGATAGAGTACGTGCAGCCGCGAGATGCTGGCGTCTATGAATGTCAAATCAATACGGAGCCGAAGATAAACATGGCAGTCGTCCTCAGTGTTGAAGATGAGTCCCTCACACCCGCGCCTCAGCCTCCGGTTCGATCCTCAGCTGCTGCAGCAACCATCTGGGGCTCTCAGGATGTGTACGTGAAGAAAGGTAGCACAATATCGCTGACATGCTCAGTGAATGTACATTCCTCGCCGCCGTCAAGCGCCTCAGTGTTATGGTATCACGGAAATGCAGTAGTGGACTTTGACTCTCCTCGCGGCGGCATCAGCTTGGAAACCGAAAAAACAGAAGGCGGCACAACGAGCAAGCTCCTAGTGACGAAAGCGGCGCTCACAGACTCAGGGAACTACACCTGTGTTCCGAATAACGCACATCCTGCCTCAGTATCTGTGCATGTGCTTAATGGAGAGCACCCGGCGGCGATGCAGACCAGTAACCGCGCCTCTTCCTTCCTCACGTCCCAGCTGAGCTGCGCCGCGCTCACATACCTGCTGTCTTCGATGGCTTGCAGATGA

Protein sequence:

>DPOGS215421-PA
MGLLAVQKLKKLLMNSVDAKVCVAGVNVNVDPRAERPYFDDVSPRNVSTVVGQSAVLRCRAKHIGNRTVSWMRKRDLHILTSHIFTYTGDARFSVLHPEPSDDWDLKIEYVQPRDAGVYECQINTEPKINMAVVLSVEDESLTPAPQPPVRSSAAAATIWGSQDVYVKKGSTISLTCSVNVHSSPPSSASVLWYHGNAVVDFDSPRGGISLETEKTEGGTTSKLLVTKAALTDSGNYTCVPNNAHPASVSVHVLNGEHPAAMQTSNRASSFLTSQLSCAALTYLLSSMACR-