Monarch geneset OGS2.0

DPOGS207510
TranscriptDPOGS207510-TA1017 bp
ProteinDPOGS207510-PA338 aa
Genomic positionDPSCF300177 - 158357-161477
RNAseq coverage961x (Rank: top 13%)
Annotation
HeliconiusHMEL0099759e-16582.67% 
BombyxBGIBMGA001934-TA3e-12475.52% 
Drosophila% 
EBI UniRef50UniRef50_UPI000224795E9e-3732.19%UPI000224795E related cluster n=1 Tax=unknown RepID=UPI000224795E
NCBI RefSeqXP_974516.16e-3234.53%PREDICTED: similar to AGAP005162-PA [Tribolium castaneum]
NCBI nr blastpgi|3838495075e-3835.22%PREDICTED: uncharacterized protein LOC100881230 [Megachile rotundata]
NCBI nr blastxgi|3838495076e-4836.66%PREDICTED: uncharacterized protein LOC100881230 [Megachile rotundata]
Group
Gene OntologyGO:00160204.5e-21membrane
GO:00055094.5e-21calcium ion binding
KEGG pathwaytca:6633722e-31 
 K06265 (DAG1)maps-> Dilated cardiomyopathy
    Viral myocarditis
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Hypertrophic cardiomyopathy (HCM)
    ECM-receptor interaction
InterPro domain[49-147] IPR0159194.5e-21Cadherin-like
[55-145] IPR0137835.9e-18Immunoglobulin-like fold
Orthology groupMCL26662 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207510-TA
ATGGAGGGGTCGCGCTATATTGCTTGCGCGCTGCTCCTATTGAGTCCTCTAGCACTCAGCCGACACGATGATGACTTCGCATTCGACAACAACGAAGAATTTCAGGTGGAATTGACAGCAAATCATTCTGAGATAGCGAAAAATGGCCTAAGAAGGTTATGGGGCGTCCCCGACACATCGGCCTACGTGGGACATCTATTCAGGATGGAAATCCCAAAGGAGGCCTTCACCGGGGACGTTTTAGCATATAAGGTACGAAGGGAAGACGGTCGTCACTTACCAGGTTGGCTGGCTGTGGATACTAAACACGGTCTGATCAGTGGCGTGCCACAGAAACAAGATCTCGGAGCACATAGTTTCACAGTTATAGCACAGGGAAGAACACACGGTCTCACAGCTACAGACACGTTCACTGTTGAGGTAAAGAAAGCGGAGGAGAAGCCCCAATCGAAGTACGGCACGTGTGTCAGAAATGAAAATAGACTGGTATTGGTGATCCTGATAGACGGAGCCTTCCACAGGATCGCGCCGCGGCAAAGGATACGAGCTCTCATGGAACTAGCCAGTTTTATGGCTTTGGATGGTGATGAATTCTGGATGGAGCCTTATAAGGCTGAGTCTGCACAGTCTCATGTAGTTCTGATGAGCGGGCCCGGGACCACGCAACGGAGGAAGAGTGACGCTACTACGGCTATTTATTTGAATGTTGGTTGTGGCGAGAGGTTGTGGTCGCGTCATAAGGTCTTGGTTTCCGGTCTGAGGGAACAATCACGGGATGGGACCCTGCATCAGATATTACGTCTCCCGGTGTTAGGGTGGAGGCTGATTGCTGTTAAGACGCTACCAAGGCTGAAAAGACAGTCTGACCTGGACGACGGGTCGGGCGCCTACGACACCTACGACGACGATGAAGGCGACTACAGCGGCTACGACGACGACGACGACTACGCCGCCGGACTCGACGTAGGCGCCGTTCCCGACATCAGTTTTCATGACAAGGTTAATGTTGACCGATAA

Protein sequence:

>DPOGS207510-PA
MEGSRYIACALLLLSPLALSRHDDDFAFDNNEEFQVELTANHSEIAKNGLRRLWGVPDTSAYVGHLFRMEIPKEAFTGDVLAYKVRREDGRHLPGWLAVDTKHGLISGVPQKQDLGAHSFTVIAQGRTHGLTATDTFTVEVKKAEEKPQSKYGTCVRNENRLVLVILIDGAFHRIAPRQRIRALMELASFMALDGDEFWMEPYKAESAQSHVVLMSGPGTTQRRKSDATTAIYLNVGCGERLWSRHKVLVSGLREQSRDGTLHQILRLPVLGWRLIAVKTLPRLKRQSDLDDGSGAYDTYDDDEGDYSGYDDDDDYAAGLDVGAVPDISFHDKVNVDR-