Monarch geneset OGS2.0

DPOGS204962
TranscriptDPOGS204962-TA2283 bp
ProteinDPOGS204962-PA760 aa
Genomic positionDPSCF300160 + 579372-588902
RNAseq coverage206x (Rank: top 46%)
Annotation
HeliconiusHMEL0037400.083.50% 
BombyxBGIBMGA011133-TA0.084.02% 
Drosophilasu(f)-PB0.059.42% 
EBI UniRef50UniRef50_P259910.059.42%Protein suppressor of forked n=16 Tax=Coelomata RepID=SUF_DROME
NCBI RefSeqXP_393870.20.069.06%PREDICTED: similar to Protein suppressor of forked [Apis mellifera]
NCBI nr blastpgi|3407231130.069.32%PREDICTED: protein suppressor of forked-like [Bombus terrestris]
NCBI nr blastxgi|3407231130.069.32%PREDICTED: protein suppressor of forked-like [Bombus terrestris]
Group
Gene OntologyGO:00063971.9e-81mRNA processing
GO:00056341.9e-81nucleus
KEGG pathway 
InterPro domain[372-690] IPR0088471.9e-81Suppressor of forked
Orthology groupMCL14011 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204962-TA
ATGAATGATGAAAATGCAGAGATTGACTGGGGTAATGAGAGGTTAAGCCGAGCCCAACGCGCGGTTGAAGCGAACACGTATGATGTTGATTCGTGGTCTCTCTTGATACGCGAAGCCCAAACCCGGCCTATAAATGAGGTCAGAACGATGTATGAGAAGCTCATTACAGCTTTCCCAACAACAGGGAGGTATTGGAAGATTTATATCGAACAGGAGATGAAAGCGAGAAATTTTGAGAAGGTCGAAAAGTTGTTTCAGAGGTGTCTCATGAAGATTCTAAACATTGAACTGTGGAGGCTATACCTAAACTACGTTAAGGAGACCAAGTGCATGTTGCCGACATACAAAGAGAAAATGGCGCAGGCCTACGACTTCGCGTTGGACAAAATAGGTCTGGACATACACGCGTATCCTATATGGAACGATTACGTAACATTCTTGAAGGCTGTCGAGGCTGTTGGCTCTTACGCTGAGAACCAGAAGATATCAGCCGTTAGGAAGGTATACCAGAGAGCGGTCATTACCCCGATAATAGGTATTGAGACGCTGTGGAAAGACTACATCGCTTTCGAACAAGGAATCAACACTATCATAGCTGAGCGTATGGCTATGGAGCGATCACGGGAATATATGAACGCGAGGAGAGTAGCCAAAGAATTGGAGACGGTCACCAGGGGCTTGAATAGGAACATGCCGGCCACCCCGCCCACCGCAGACAGGGAGGAGATGAAGCAGGTGGAGTTGTGGAAGAAGTACATATCCTGGGAGCGCTCCAACCCCCTCAGGTCGGAGGATACCGCTCTCGTGGCCAGGCGGGTGATGTTCGCTATAGAACAGTGTCTGCTATGTCTGGCCCACCACCCGGATGTATGGCATCAGGCGGCGCAGTTCCTCGACCATTCATCTAAATTACTGCAAGAAAAGGGGGATTCGACAGCCGCCCGTCTGTTCTCCGAGGAGGCCGGTGCAGTCTACGAGAGAGCCACATCCGGTCCGCTCAAACATTCCACCTTACTGCACTTCGCTCACGCCGACTATGAGGAGAGTCGGCTGCATTACAACAAGGTACACCAGGTATACACTCGCTATCTGGATATGGCGGACATCGAACCCACGCTGGCCTACGTTCAATATATGAAGTTTGCGAGACGAGCTGAAGGTATCAAGTCGGCTAGGACGGTGTTCAAACGAGCCAGGGAAGACCCGAGATCCCGTTACCACGTGTTCGTGGCGGCCGCCCTCATGGAGTACTACTGCTCCAAAGACAAGAACATTGCTTTCAGGATATTTGAGTTGGGCCTCAAGAAGTTCTCCCACATTCCGGAGTATGTGTTGTGCTACATCGACTACCTGTCACATTTGAACGAGGATAACAACACCCGCGTGTTGTTCGAGCGCGTCCTGTCATCTGGATGTCTGAAGCCGGAGAGTTCTGTTGATATCTGGAATAGATTCCTGGAATTTGAATCCAATATTGGGGACCTCGTCAGTATAGTGAAGGTTGAGAAACGGAGGCAGGCGGTTCTGGAAAAGATCAAAGAGTTTGAGGGTAAGGAGACGGCTCAGCTCGTGGACAGATACAAGTTCCTGGATCTCTACCCCTGTACTATAGCTGAACTCAAGTCCATAGGATACACAGAGGTAGCATCAATGTCGAACAAGTCCTGGGCTCTCGGAGGACCGCTTGCTGGCATCTCACCAGAATTGGCCGCTGTGATACTAGGACAGAAAGACAATGATCCGAACAAGGACATAGTTCGTCCAGACACAAGTCAAATGATACCCTACAAGCCAAAATCGAACCCACTCCCCGGAGAGCATCCTATACCAGGTACGTACAAAGACAATGATCCGAACAAGGACATAGTTCGTCCAGACACAAGTCAAATGATACCCTACAAGCCAAAGTCCAACCCACTCCCCGGAGAGCATCCTATACCGGGCGGTTCCTTCCCGCTGCCTCCCGCGGCCGCCGCCCTGTGCACGGCGATGCCTCCCCCCTCCAGCTACAGAGGACCCTTCGTGGCGGTGGACATGCTGATAGCACTCTTCAACAGGATCACACTACCCGACAAACCCGCTGCCCCGACCAATGAGAACGGCTGTGACACCAAACTGTTTGAGCTGGCTCGCTCCGTCCACTGGATCATGGACGATGACACCACCAAGAATAATACGGCTCGCAGAAGGAAGCTGGGTTCGGACTCGGATGACGACGAGCTGGGCGCGCCGCCGCCTCTCAACGACGTCTACAGACAGAGACAACAGAAGAGAGTCAAGTGA

Protein sequence:

>DPOGS204962-PA
MNDENAEIDWGNERLSRAQRAVEANTYDVDSWSLLIREAQTRPINEVRTMYEKLITAFPTTGRYWKIYIEQEMKARNFEKVEKLFQRCLMKILNIELWRLYLNYVKETKCMLPTYKEKMAQAYDFALDKIGLDIHAYPIWNDYVTFLKAVEAVGSYAENQKISAVRKVYQRAVITPIIGIETLWKDYIAFEQGINTIIAERMAMERSREYMNARRVAKELETVTRGLNRNMPATPPTADREEMKQVELWKKYISWERSNPLRSEDTALVARRVMFAIEQCLLCLAHHPDVWHQAAQFLDHSSKLLQEKGDSTAARLFSEEAGAVYERATSGPLKHSTLLHFAHADYEESRLHYNKVHQVYTRYLDMADIEPTLAYVQYMKFARRAEGIKSARTVFKRAREDPRSRYHVFVAAALMEYYCSKDKNIAFRIFELGLKKFSHIPEYVLCYIDYLSHLNEDNNTRVLFERVLSSGCLKPESSVDIWNRFLEFESNIGDLVSIVKVEKRRQAVLEKIKEFEGKETAQLVDRYKFLDLYPCTIAELKSIGYTEVASMSNKSWALGGPLAGISPELAAVILGQKDNDPNKDIVRPDTSQMIPYKPKSNPLPGEHPIPGTYKDNDPNKDIVRPDTSQMIPYKPKSNPLPGEHPIPGGSFPLPPAAAALCTAMPPPSSYRGPFVAVDMLIALFNRITLPDKPAAPTNENGCDTKLFELARSVHWIMDDDTTKNNTARRRKLGSDSDDDELGAPPPLNDVYRQRQQKRVK-