Monarch geneset OGS2.0

DPOGS209800
TranscriptDPOGS209800-TA1659 bp
ProteinDPOGS209800-PA552 aa
Genomic positionDPSCF300117 - 575755-584402
RNAseq coverage427x (Rank: top 29%)
Annotation
HeliconiusHMEL0089870.067.83% 
BombyxBGIBMGA008025-TA0.082.92% 
Drosophilaash2-PC0.066.36% 
EBI UniRef50UniRef50_Q9VC550.066.36%Absent, small, or homeotic discs 2, isoform D n=39 Tax=Coelomata RepID=Q9VC55_DROME
NCBI RefSeqXP_968500.20.072.66%PREDICTED: similar to trithorax protein ash2 [Tribolium castaneum]
NCBI nr blastpgi|2700009370.072.66%hypothetical protein TcasGA2_TC011210 [Tribolium castaneum]
NCBI nr blastxgi|2700009370.072.66%hypothetical protein TcasGA2_TC011210 [Tribolium castaneum]
Group
Gene OntologyGO:00055158.9e-09protein binding
GO:00082701.4e-06zinc ion binding
KEGG pathway 
InterPro domain[263-510] IPR0089855.7e-42Concanavalin A-like lectin/glucanase
[343-506] IPR0183553.6e-28SPla/RYanodine receptor subgroup
[20-101] IPR0110114.6e-11Zinc finger, FYVE/PHD-type
[344-415] IPR0038778.9e-09SPla/RYanodine receptor SPRY
[20-84] IPR0130831.6e-07Zinc finger, RING/FYVE/PHD-type
[33-83] IPR0019651.4e-06Zinc finger, PHD-type
Orthology groupMCL13292 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209800-TA
ATGGACCCCGCTTCCAATGGTCAAAACATGCAATCCGAATCTCAGAAAGCGGGGGAAAATGAAAAAAATAAAAGTAAACCCGGAGACACGCAGGGAAACTGTTATTGTGGCAAAGAAAGGAATCTCAACATAGTGGAACTGTTGTGTGCATCATGCAACAGGTGGTATCATGAGTCTTGTATAGGATATCAGTTGGGAAAACTGGTACCATTTATGACGAATTACCTCTTCATATGTAAGAACTGTTCACCAACTGGTTTGGAAACATTTAAAAAGAATCAAGCTCCTTTCCCTCAAATGTGCCTGACTGCAATTGCAAACCTCAAGCAAGAGAGTGCCAAAGATGGCACTAATAGGATCTTATTCAGTAAAGATAGAGAGATAATTCCCTATATAGATCAATATTGGGAAGCCATGACAACTATGCCAAGGAGGGTGACCCAATCTTGGTATGCAACAGTGCAAAGAGCTCTTATTAAAGATATTCAAGTGCTGTTTATTTATGAAGAGGATCAGTCCCAAGGGCCAATGTTCGGTTTATTCAATATGGAATTGACTAATATCAAGCCCAATTATGAAGCCATGATCAAGCAAGGGCAACTCAAGGTCACCGACATGGGAATTGCAACAGTTCAACTAGCCGGTAACGTAAAGGGTCGCCAAGGTAAGCGTCGTCCGGTAGGAGTGGAAACAAGCGCCCCCGTCGGCAAGAAAGGTCGATCAGCTGATTTGGGCGCTCTAAAACTACCCTCCCACGGCTATCCGACCGAACATCCGTTCAATAAAGATGGATACCGCTACATACTAGCTGAACCCGATCCACACGCCCCGTTCAGACAGGAGTTTGATGAGAGCAACGAATGGTCCGGGAAGCCGATCCCGGGCTGGTTGTACCGGTCGCTGTGTCCCGGGATAGTGTTGCTGGCGTTACACGACCGAGCGCCGCAGCTGAAGATAGCCGAGGATCGGCTGGCTGTGACCGGCGAGAAGGGATACTGTATGGTGCGAGCTACGCACGGCGTTTCCCGCGGCTCGTGGTACTGGGAGGCGACTGTGGAGGAGATGCCCGAGGGCGCCGCGGCCAGGCTCGGCTGGGGGCGGCGCTACGCAAACCTACAAGCACCTCTCGGATACGACAAGTTCGGATACTCGTGGCGCAGCAGAAAAGGGACAAGATTCCACGAGTCCCGCGGTCGGCACTACAGCGCTGGCTACGGCGAGGGCGACACTCTCGGCTTCCTCGTCGTGTTACCTGATAACGGCGCCGCTAAATACACGCCAAGCACGTACAAAGACAGGCCTTTAGTTAAATTCAAAAGTCATCTGTACTATGAAGATAAGGACAATATCCAAGAATCTCTGAACAACCTCCGAGTGCTGTCCGGCAGTAAGATATACTATTTCAAGAACGGAGAATGTCAAGGCGAGGCGTTTGTGGATATTTACCAAGGATGCTATTACCCGACTGTGTCTTTACATAAGAACATCACAGTTAGTGTGAACTTTGGACCAAATTTCAAATATCCACCTAACATTGAACACAGCTTTAGACCGATGTCTGAGAAGGCTGAGGAGGCAATATGCGAGCAGACAATGGCGGATTTATTATTCCTTACCGAAAATGAGGGTAAATTACGTTTGGATGCCTTCAACCTCTGA

Protein sequence:

>DPOGS209800-PA
MDPASNGQNMQSESQKAGENEKNKSKPGDTQGNCYCGKERNLNIVELLCASCNRWYHESCIGYQLGKLVPFMTNYLFICKNCSPTGLETFKKNQAPFPQMCLTAIANLKQESAKDGTNRILFSKDREIIPYIDQYWEAMTTMPRRVTQSWYATVQRALIKDIQVLFIYEEDQSQGPMFGLFNMELTNIKPNYEAMIKQGQLKVTDMGIATVQLAGNVKGRQGKRRPVGVETSAPVGKKGRSADLGALKLPSHGYPTEHPFNKDGYRYILAEPDPHAPFRQEFDESNEWSGKPIPGWLYRSLCPGIVLLALHDRAPQLKIAEDRLAVTGEKGYCMVRATHGVSRGSWYWEATVEEMPEGAAARLGWGRRYANLQAPLGYDKFGYSWRSRKGTRFHESRGRHYSAGYGEGDTLGFLVVLPDNGAAKYTPSTYKDRPLVKFKSHLYYEDKDNIQESLNNLRVLSGSKIYYFKNGECQGEAFVDIYQGCYYPTVSLHKNITVSVNFGPNFKYPPNIEHSFRPMSEKAEEAICEQTMADLLFLTENEGKLRLDAFNL-