Monarch geneset OGS2.0

DPOGS214971
TranscriptDPOGS214971-TA1449 bp
ProteinDPOGS214971-PA482 aa
Genomic positionDPSCF300616 + 4183-16881
RNAseq coverage245x (Rank: top 42%)
Annotation
HeliconiusHMEL0178540.081.89% 
BombyxBGIBMGA009255-TA2e-12376.49% 
DrosophilaStim-PA1e-15961.40% 
EBI UniRef50UniRef50_F4WE241e-17067.71%Stromal interaction molecule-like protein n=12 Tax=Arthropoda RepID=F4WE24_ACREC
NCBI RefSeqNP_001128674.10.083.44%stromal interaction molecule 1 [Bombyx mori]
NCBI nr blastpgi|2067255010.083.44%stromal interaction molecule 1 precursor [Bombyx mori]
NCBI nr blastxgi|2067255010.084.14%stromal interaction molecule 1 precursor [Bombyx mori]
Group
Gene OntologyGO:00055154.8e-16protein binding
KEGG pathway 
InterPro domain[96-181] IPR0109934.8e-16Sterile alpha motif homology
[104-172] IPR0115101.8e-10Sterile alpha motif, type 2
[96-152] IPR0137618.2e-10Sterile alpha motif-type
Orthology groupMCL11866 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214971-TA
ATGTTTAAGACGACATGTTATAATTTTTCTATATATCAACCTTTGTGTACTAAGGACGTACTTCTGGAGGCCTGTAATAACGAGCCGGCGTGTCTCCAAGACCACGCTGGTCTGGAGGCCATCACACAGCTCCACCGACAGCTGGATGACGACGCGAATGGAAATGTTGATCTCAGTGAGAGTGATGACTTCTTACGCGAGGAGCTTCAATACGACAGCGGCTATGAGAAGCGCCAGCGAGCCTTCCATCACAACGACGACATGCACATATCTGTCAAAGAACTTTGGGAGGCCTGGCTACGGTCAGAGGTCCACAATTGGACGGTGGAGCAGACTGTGGTGTGGCTGTCCGAGTCTGTGGAGTTGCCGCAGTACAGGACGCTGTTCTTGCAACATAGAATCACTGGTGCAGCATTACCGAGATTAGCTGTGAACAACATGCAATACATGAGCAACGTTCTGGGCATAAAGGACCCTATACACAAACAGAAGCTAGCCCTCAAAGCCATGGATGTTGTATTGTTTGGACCACCCAAAGAAGGCAGTCGTTGGAAGGACTGGTTGTTAGCATCGCTTCTGTTGGGTGCTGTTGTTGGAGGCTGGGCAGCGCTGAGAGCCGGCCGTGCCAGTAGGCATCAGGTACAGAGGATGTTGAGGGACATGGAACATCTGAGGAAAGCTGAAATGGCGCTGGACGATATGCAGAAAGAATTGGAGAAGGCGCGTCTGGAGCAGGAGAGCGTCACGACGGAAAAAAAGAATCTAGAGAAGAAGTTGCAGGAAGCTGGAGACACGCCGATGTTGAACTCAGCGTCATCAGACCTAGAAGTTACTCAACTCAAGTCCGAAATAGAGATGCTACGTGCTGAGCTGCGTCGAGCTGAAGGTGAGCTGGAGGACCGTTGCTGGGCGCCACCGGCCGGGCTCCAGCAGTGGCTGCAGCTCACACACGAGATAGAGAACAAGGCCTACCTCAGGAAGAAACAGACGGCTGACGGGCAGCTGCAGCAGGCCAGGGACGCGGTGAGTGACAGGGACAAACAGGGTGACGGACGAAAGCGAGAAAGAGACAGAGCGATACCGTCAGACGGAGAGGTGTGTGAAAAGCTACGGAAGAAACGTTCCAGTCTGGTGGGAGCCTTCGTGTCGACACACGGCAAGTCCATAGACGACGTGGACCGAGCCATAGTAGAGGCTAGGACCGCTCTCAACGAAGTTACACAGGAGCTACAGGAACGCATGCACCGATGGAAGCAGATCGAAAGGTTGTGTGGCTTCAATATAATAAATAACAACGGATTACAGTTCCTAGAGAGCACCCTGTATAGGACAGCTAACGGGAGACAGGGGAAAGTCCGAGTGAGCAGTTCGCAGGATGATCTCAGCCTTGGAGACGACACCTCACTGTGCGGATCAGGTAACGAAGTGTTCCACGACCATAGCGACTGA

Protein sequence:

>DPOGS214971-PA
MFKTTCYNFSIYQPLCTKDVLLEACNNEPACLQDHAGLEAITQLHRQLDDDANGNVDLSESDDFLREELQYDSGYEKRQRAFHHNDDMHISVKELWEAWLRSEVHNWTVEQTVVWLSESVELPQYRTLFLQHRITGAALPRLAVNNMQYMSNVLGIKDPIHKQKLALKAMDVVLFGPPKEGSRWKDWLLASLLLGAVVGGWAALRAGRASRHQVQRMLRDMEHLRKAEMALDDMQKELEKARLEQESVTTEKKNLEKKLQEAGDTPMLNSASSDLEVTQLKSEIEMLRAELRRAEGELEDRCWAPPAGLQQWLQLTHEIENKAYLRKKQTADGQLQQARDAVSDRDKQGDGRKRERDRAIPSDGEVCEKLRKKRSSLVGAFVSTHGKSIDDVDRAIVEARTALNEVTQELQERMHRWKQIERLCGFNIINNNGLQFLESTLYRTANGRQGKVRVSSSQDDLSLGDDTSLCGSGNEVFHDHSD-