Monarch geneset OGS2.0

DPOGS204712
TranscriptDPOGS204712-TA1683 bp
ProteinDPOGS204712-PA560 aa
Genomic positionDPSCF300257 - 157936-161726
RNAseq coverage2655x (Rank: top 5%)
Annotation
HeliconiusHMEL0117100.072.74% 
BombyxBGIBMGA008249-TA0.067.89% 
Drosophilasmg-PD5e-3333.44% 
EBI UniRef50UniRef50_E2C1I01e-4330.93%Sterile alpha motif domain-containing protein 4B n=8 Tax=Formicidae RepID=E2C1I0_HARSA
NCBI RefSeqXP_001602138.13e-5132.55%PREDICTED: similar to MGC85099 protein [Nasonia vitripennis]
NCBI nr blastpgi|3407200364e-5131.99%PREDICTED: protein Smaug homolog 2-like [Bombus terrestris]
NCBI nr blastxgi|3407200364e-5131.84%PREDICTED: protein Smaug homolog 2-like [Bombus terrestris]
Group
Gene OntologyGO:00055152.6e-15protein binding
KEGG pathway 
InterPro domain[324-382] IPR0137614.5e-23Sterile alpha motif-type
[323-384] IPR0109932.6e-15Sterile alpha motif homology
[327-382] IPR0211291.4e-12Sterile alpha motif, type 1
[322-385] IPR0016604.4e-08Sterile alpha motif domain
Orthology groupMCL12745 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204712-TA
ATGAACGGGACGTTTTACGAGCAGCTCGGGGGTGTGGCCAAGATGTTCGAGCAGTGGGGCACCTGCGAGCGGACGGTGGTGGCCTGCGCCCTCGCTCGAAGAGTGCCGTGGCCAGGGTTGCGGTTGGTTCAGAGAGCGGTGGAGGCAGCCCTGCAGAACCATGTAGGGGATGAGCGACTCGAGAGAGATGCGAACGACGAAACGTTACTCGCCAGCCTCCTCGCGTCCAGAACTGATGAACACGATGAAGAAGATGGCGAACGTCTCAGACAGCTCCTTGCGCTGCTGCCTCTCCTCCGTACTGACAATGAGCGTGCTAAGAATGTGTACGTGTCGGCGACCCCCGGCCTGGTCCAGCGCTGTGTGGACTCATCCAGGTCACCTGCAACCCCACACCTGTGTAGACAGCTGCTGTCCTACCAACTGGTACATCCAGCCTTCACTGTACATGACCAGAGAACACTGACGCAATGGCTTCGCTATTTAGAAAATCACATATCTGGGAACAGAAACAGTGAGACGCCGTGGCAGCAGCGGATAGAACCGGCCCTGTTGCAGGACACGAACATCTGGTCGGCCAACAACACCTTCCGCCGCACCATCGGCAAGAACGTGGACTTCCGGGGGATGTTGGACTCGTTGGAGCACGCGGCCTACACGGACGTGTTGCAGGAGTCCTTCTCCAAGAACGGCAGGGACGTGGATATCGGCCTGGACGGAGACGCCGCCCACTACGAGGCGCAGACCAAGTCACACCGGTCTAATAGTCTCACGCCGCCCTCCACCAACTTCATGCAGATGTCCTCCTCGGCCGAGAACCTCAGCGACGAGCCGTTCGTCCAGAAACCGAGGAGCTTCTCGCTATCCAGCGAGCACAGTCTGACTCAGCTGCGGCCCATAGGGGTGACGTATGGAACCACCGGCAGCGAGACAAGGCTGGATGACCTCCGGACCAACAATTTTGCGGAACATCCCGGCATGTCCACCGTGGGGCAGTGGCTCAAGAGTCTCCGGCTGCACAAGTACGTGTGGCTCTTCACCAACATCACCTACGAGCAGATGATGGCCATGGACGACAAGTACCTGGAGAAACTGGGTGTGACGAAGGGCGCGCGTCACAAGATCCTGCTGTCGATCGCTCGTCTGTCGGAGCGGCCGTCTATCTTGGAGTCGGTTCGGAGTGAGCTGTCTTCAGGCCGGGTGTGCAGAGCTCTGGACCGTCTCCGGAGCGTGCTGCTCTCGCCTATGCCGCCCGGGGACCTGCCCCGGGCCGTGGTCGCAGCCCTACAACACGCGTCCGAGTGTTTGTCGGGAGGCGCGGGATCAGTGGTAGCGGAGAACGAGCCGGAGGCCGTTGACCCCATGTCGCTGCATTGCTGGCTCATAGAGAAGGCTCTCCACCACGAGTCGTTCTCGTGTCCGTCTCTCCAGTCATCTCTCCGCTGTCTCCGCCACCGCGTTCCACCTCGACAGTTCTTCCACCTTGTGGGAGACGCGCCTCACAGGAGACTCAAGCCCCGTTGGCGCGCCCCGGCCGCCGCCCGCCGCCGATGGGCGCCGCCCGCCCGCGGCAAGTCCAACTCGTACCCGCCGTTCCCGCCGCAGGTCGCACCGCCGCCGCCCCACGACTACTCCAGCCTGGACGCGCTCTGCCTGCAGATGACGGAGACGGCCATCGACTAG

Protein sequence:

>DPOGS204712-PA
MNGTFYEQLGGVAKMFEQWGTCERTVVACALARRVPWPGLRLVQRAVEAALQNHVGDERLERDANDETLLASLLASRTDEHDEEDGERLRQLLALLPLLRTDNERAKNVYVSATPGLVQRCVDSSRSPATPHLCRQLLSYQLVHPAFTVHDQRTLTQWLRYLENHISGNRNSETPWQQRIEPALLQDTNIWSANNTFRRTIGKNVDFRGMLDSLEHAAYTDVLQESFSKNGRDVDIGLDGDAAHYEAQTKSHRSNSLTPPSTNFMQMSSSAENLSDEPFVQKPRSFSLSSEHSLTQLRPIGVTYGTTGSETRLDDLRTNNFAEHPGMSTVGQWLKSLRLHKYVWLFTNITYEQMMAMDDKYLEKLGVTKGARHKILLSIARLSERPSILESVRSELSSGRVCRALDRLRSVLLSPMPPGDLPRAVVAALQHASECLSGGAGSVVAENEPEAVDPMSLHCWLIEKALHHESFSCPSLQSSLRCLRHRVPPRQFFHLVGDAPHRRLKPRWRAPAAARRRWAPPARGKSNSYPPFPPQVAPPPPHDYSSLDALCLQMTETAID-