Monarch geneset OGS2.0

DPOGS207511
TranscriptDPOGS207511-TA2397 bp
ProteinDPOGS207511-PA798 aa
Genomic positionDPSCF300177 - 150463-156788
RNAseq coverage1235x (Rank: top 10%)
Annotation
HeliconiusHMEL0155660.086.69% 
BombyxBGIBMGA001935-TA0.079.25% 
DrosophilaDg-PD4e-10936.75% 
EBI UniRef50UniRef50_E0VQL26e-13537.80%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VQL2_PEDHC
NCBI RefSeqXP_002428406.11e-13537.80%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|3072098534e-13943.85%Dystroglycan [Harpegnathos saltator]
NCBI nr blastxgi|3072098536e-15240.99%Dystroglycan [Harpegnathos saltator]
Group
Gene OntologyGO:00160203.2e-20membrane
GO:00055093.2e-20calcium ion binding
KEGG pathwayphu:Phum_PHUM3796203e-135 
 K06265 (DAG1)maps-> Dilated cardiomyopathy
    Viral myocarditis
    Arrhythmogenic right ventricular cardiomyopathy (ARVC)
    Hypertrophic cardiomyopathy (HCM)
    ECM-receptor interaction
InterPro domain[545-798] IPR0084652.1e-49Dystroglycan
[421-526] IPR0159193.2e-20Cadherin-like
[422-523] IPR0137833e-17Immunoglobulin-like fold
[423-527] IPR0066444.8e-10Dystroglycan-type cadherin-like
Orthology groupMCL14439 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207511-TA
ATGGCTTTTGCCAATGGTGGCAGTACTAACCCGTGTACCGCGCGGCGCCAGGTGGACGCCCCAAAACCCCCGCACCCACACCGCCACCACCACGGCCAGCCCGACTTCGCGCCACACACCACCACCGACACGTTTCAGACCGAGCTCACCGACAACACGGCGATAACCGACTCCCAACCCCCGACCGAACCCCCGACGACCCCCGAACTCCAACCCCCAATCCCCGCGTCCCCGATCCCCTCCACGCCAGCCGCGACGCTTGTGGAACACACGCTCACGGAGACCCCCGAAACAGCCTCAACTAATACGTATACCACGGAGGATCACAAAGTTAAAATCACACCGTATTCTGAATCCGCGTCCGAGTCGACTCAAGTGAAGATGACGAAGGAACATTTGGCTAACATACCGAGACGACTAGACGAACATTCGCCCATAATAATTGTCCCAGAAGACATTCCGACGACAACTGAACCGGAAGAGGATACCATAACTTCAACGGAGCGTACCAGTACTATAGTTGGCAGAACAACAGAGACCGTTCCTTTCCCTATGCCGGTGAATCAGCCGCCGACACTCAAACATCACATGAAAAAACTAGCCATTACAGCCGGAAAGGCCTTCAGATACATTATACCAGCCGATCTATTCACTGATCCTGAAGAAGGCAGCAACTTAACGTTTAGGATGTACGAAGAGGAAAATGTACCACTTAACAAGAACTCCTGGATACAGTTCCTACCATCCGAACGAGAGGTCTATGGATTACCACTAGAAGCGCACGTGTCTCGCTGGAACTTCATAGTAGAAGCTCAGGACAGCGAAGGTCTCGTGGCCAGGGGACCTCTGGACATTACGGTGCAACAGCACAAGAGCGGCAGAACTATCAACCACCAGTTCATAATGAAGATGAAGTTACAGAAGAATTACAACAACGCCGTGGACTGGCAGATACGAGCCTTAGAGGGCATCGTGAACCTGTTCAGGGACACCGACATGGATCACCTCACTGTACTGAACACCACGCAGAACGGAGATCTGTATGAATTCGTGTGGACCAACGACACCCTGCCCAAGGATCCCGCCTGCCCCATGGATGATATCAACAGGCTTATGAAGATAATGGTATCTGAGCCGGAGTCAGGCAGCCCGTCCCCTAGTCTGTCTCGAGCGATGTTCCCAGACATGAAGGTGTCGGAAGTGAGGTGGAGGGGCGCAGGCCGATGTGTTCCCCCCTCCACACGCGCTCACGACACTTACCCCCCGGTCACTAGGAACCAGGTGGATCATCTCACAGCTACAGTTGGACATCTACTACTGTATAAAGTGCCAGAGGACACATTCTTCGACCCGGAGGACGGTGGTACTCGTAACCTTCAGCTGTCGCTCAGGTTCAGCGACCGCTCAGAGATCCCTTCGAACCACTGGTTGCAGTTCGATGCACGAAACCAGGAATTCTACGGCCTGCCCGGGCCTGGGGATGAGAAGATTGTACATTATCAGCTGATCGCGGAAGACTCTAGTAAGAAGAGTGCGTACGACAGTCTCATAGTGGAAGTAGCTAAGGCGCCGACCATACGACCGACCGTCGAGTTCCAGATGACAATGGACCCGTCGCCACTTGCGGATAGCGCCACCAACAAGAGGAAGGTTGTCGAGAAGTTGGCGGCGCTGTTCGGACAGACCGACACAGACAATATACGCATACAGAGCATCACAGACAACCCCACCACTATTGTATGGTACAATACCAGTCTGCCAATGGACAGATGTCCCAAACGGGAAATTGAGGAGCTACGAAGGATGATAATAGCCGACGAACGGGGATCTGTCGGCGGAAACCTCAAAGAGCACGTCGACCAGATATTCGACAAAGACTTAAAGGTTATGTCCATACGGCTCATACCATTGGGACTTTGCGCGGAACAGAATACGAAGACACCCAAAACAATGGCGCCCTCTCACGGCCCGAATCTACAGAACAAGGCTACTAACGCCTCCCCCGAATACTCGGACTATTTGGTCACGTTCGTGGTACCAGCCGTCGTCATCGTCTGCATGATCGTGGTAGCGGCCATCATCGCTTGCGTACTGTACAAGAGACGGCGAACAGGTAAAATGAGCGTTGGCGACGAAGAAGAGCGTCAGGCGTTCAGATCTAAAGGGATCCCTGTGATCTTCCAAGACGAACTCGAAGAGAGAGTCGAAACTGAGCCGGGGGATAAGAGTCCTGTCATCATGAGAGAGGAAAAGCCACCCTTACTGCCGCCGACGCCAGATTATCGTAATGAAGACGCACCTTACCGGCCTCCGCCGCCCTTCGCAGCCTCGCGCACCCCCCCGCGACCTAAAGCGACCCCCACTTATAGAAAACCACCCCCCTACGTACCGCCCTAA

Protein sequence:

>DPOGS207511-PA
MAFANGGSTNPCTARRQVDAPKPPHPHRHHHGQPDFAPHTTTDTFQTELTDNTAITDSQPPTEPPTTPELQPPIPASPIPSTPAATLVEHTLTETPETASTNTYTTEDHKVKITPYSESASESTQVKMTKEHLANIPRRLDEHSPIIIVPEDIPTTTEPEEDTITSTERTSTIVGRTTETVPFPMPVNQPPTLKHHMKKLAITAGKAFRYIIPADLFTDPEEGSNLTFRMYEEENVPLNKNSWIQFLPSEREVYGLPLEAHVSRWNFIVEAQDSEGLVARGPLDITVQQHKSGRTINHQFIMKMKLQKNYNNAVDWQIRALEGIVNLFRDTDMDHLTVLNTTQNGDLYEFVWTNDTLPKDPACPMDDINRLMKIMVSEPESGSPSPSLSRAMFPDMKVSEVRWRGAGRCVPPSTRAHDTYPPVTRNQVDHLTATVGHLLLYKVPEDTFFDPEDGGTRNLQLSLRFSDRSEIPSNHWLQFDARNQEFYGLPGPGDEKIVHYQLIAEDSSKKSAYDSLIVEVAKAPTIRPTVEFQMTMDPSPLADSATNKRKVVEKLAALFGQTDTDNIRIQSITDNPTTIVWYNTSLPMDRCPKREIEELRRMIIADERGSVGGNLKEHVDQIFDKDLKVMSIRLIPLGLCAEQNTKTPKTMAPSHGPNLQNKATNASPEYSDYLVTFVVPAVVIVCMIVVAAIIACVLYKRRRTGKMSVGDEEERQAFRSKGIPVIFQDELEERVETEPGDKSPVIMREEKPPLLPPTPDYRNEDAPYRPPPPFAASRTPPRPKATPTYRKPPPYVPP-