Monarch geneset OGS2.0

DPOGS201993
TranscriptDPOGS201993-TA2004 bp
ProteinDPOGS201993-PA667 aa
Genomic positionDPSCF300060 + 153683-159223
RNAseq coverage1134x (Rank: top 11%)
Annotation
HeliconiusHMEL0056140.083.86% 
BombyxBGIBMGA010562-TA0.078.79% 
DrosophilaCka-PE0.052.19% 
EBI UniRef50UniRef50_E2APP10.053.22%Striatin-3 n=14 Tax=Formicidae RepID=E2APP1_CAMFO
NCBI RefSeqXP_001601585.10.054.34%PREDICTED: similar to striatin [Nasonia vitripennis]
NCBI nr blastpgi|3320315140.054.28%Striatin-4 [Acromyrmex echinatior]
NCBI nr blastxgi|1892362700.055.88%PREDICTED: similar to AGAP008755-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055153e-63protein binding
KEGG pathway 
InterPro domain[338-666] IPR0159433e-63WD40/YVTN repeat-like-containing domain
[346-666] IPR0110464e-61WD40 repeat-like-containing domain
[38-163] IPR0132582.2e-42Striatin, N-terminal
[582-621] IPR0016802.2e-08WD40 repeat
[584-621] IPR0197812.7e-08WD40 repeat, subgroup
[373-387] IPR0204727.9e-07G-protein beta WD-40 repeat
Orthology groupMCL10720 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201993-TA
ATGGATGATAGCTCCGTATCTCATCATAACGGAGGGCAAGTGGGATCTCAGATAGGAGTTAGTGTAAATAACAAACAAAATGAAGAATCTAGTCAAAGTGTCCAGTATTCAATACCGGGCGTTTTGCATTTCATCCAACATGAATGGGCGAGATTTGAAGTCGAAAGATCACAATGGGAGGTGGACAGAGCAGAGTTTGAGGCACGGATTGCCTTCCTCCAAGGTGAGAGGAAGGGTCAAGAGAATCTAAAAAATGATTTGGTAAGGCGGATCAAAATGCTTGAGTATGCACTTAAGCAGGAAAGAGCGAAGTTTCATAAGCTGAAATATGGAGTAGATCTTCAGCAGAGGGACATGCATCCACCAGCTGAAGAGCCTACACAGGAACCAGAGCCTTCAGAGCGCGCCCAATGGAAGCAGGGTAGACAATTGATTAAACAGTACCTACAGGAAATTGGTTACACAGATACCATCCTTGATGTTCGTTCAAACAAAGTAAGAACTTTGCTTGGTTTAAACAATGAGGAGCCGGCCGATGATAGTAGCCACAGAAACTGTGACAAACAACAACACTATTCTATGAATACTGCCTTAAATCGTAAGATGTATGAATATGGTGGAAAAGATCAACGCAAAGGAGCTCAGCAAGGCAGCGGTTATAATGAAGAAGGGATGTCCGTTCAAGAGACAGCGGCTGTGTTTGCTAACTTTGAGTTTCTATCTAATCAGGAAATGGATATGGATGAAATTGATGACTTAGATGCCAAACAAATGCACCACACTGCAACTAAGCAAGGTGAAGAGGTGGACCAAGAAGCGGAGGAAGTATTAAATGAACTAAACCTGCTAACTGAGAGCGAGGCTGACGGAGGACAAGGAGATGAGTTCCCAACCGTGAAATTCCCAGCCTCTTCAGCGACAGCGGCGCGTGAGGGTGACGGGAAGCCGCTAGCTCTGGGAGAGCTGGCACAGCTGACGGTCAGCAACGAGTCCGAGGGTTACGACGTGGCCAGCGCTAGCAAGGAGTCCTTCAGGAAGACCTGGAACGCTAAATACAGTCTGCGTTCACACTTTGATGGGGTGCGTGCTCTCGCCTTCCATCCGACGCGGGCTGCTTTGGTGACAGCCTCCGAGGATCACACGCTGAAGATGTGGGATCTCCAGCGTACTGTCCCAGCTAAGAAGTCGGCCGGTTTGGATGTGGAACCTCTGTATACGTTCAGAGCCCACACAGCGCCGGTGCTCTGCCTCGCTATGGGCGCTCCCCGCTCGGAGGAATGTTTCTCAGGGGGTCTGGACGGGACCATCAGAGTTTGGAACCTGCCCCCACCCATCGCTGATCCATACGACTCATATGATCCCGCTGTATTGGGGCCTGTACTCCGCGAGCACACGGACGCGGTGTGGTCGCTGTCGTGCGCTGGTGGTAGGCTGTTGTCTTCATCAGCGGACGGTACAGCCAGGCTGTGGTCCCCACGAGACCCTCGGCCCCTGCTCGCCACGCTCAGGGACGACTCCCCCGGCTCCACCTCAGCGCCGGCGGCCGCGGACTTCGCTGATGCAGCCTGCCGCGCGGCAGTCGTCTACACTGACGGCACTCTTCTATTATACGACCTCGAAACTGGACAGGTGGTGTTGCGCGTGCCGTGCGACAGTCCCGCCAACCGCGTCCGCTCCCATCCAACTCTGCCGCTCCTTGTGACGGCTCACGAGGATCGTCACATCCGTTTCTGGGACGCCGTTTCCGGTCGCTGTGCGCACGCGATGGTCGCCCACCTGGACGCCGTGACGGGTCTGGCACTGGATCCGAACGGCCTGTTCCTGCTGTCGGGCTCCCACGATTGTTCCGTGCGCCTGTGGAACCTGGACACCAAGACCTGCGTGCAGGAGATCACTGCTCACAGGAAGAAGTTCGACGAGAGCATCCTGGATGTGGCCTTCCACCCCCTCCGGCCGTACATCGCCAGCGCCGGCGCCGACGGCCTCGCTAAAGTCTTCGTCTGA

Protein sequence:

>DPOGS201993-PA
MDDSSVSHHNGGQVGSQIGVSVNNKQNEESSQSVQYSIPGVLHFIQHEWARFEVERSQWEVDRAEFEARIAFLQGERKGQENLKNDLVRRIKMLEYALKQERAKFHKLKYGVDLQQRDMHPPAEEPTQEPEPSERAQWKQGRQLIKQYLQEIGYTDTILDVRSNKVRTLLGLNNEEPADDSSHRNCDKQQHYSMNTALNRKMYEYGGKDQRKGAQQGSGYNEEGMSVQETAAVFANFEFLSNQEMDMDEIDDLDAKQMHHTATKQGEEVDQEAEEVLNELNLLTESEADGGQGDEFPTVKFPASSATAAREGDGKPLALGELAQLTVSNESEGYDVASASKESFRKTWNAKYSLRSHFDGVRALAFHPTRAALVTASEDHTLKMWDLQRTVPAKKSAGLDVEPLYTFRAHTAPVLCLAMGAPRSEECFSGGLDGTIRVWNLPPPIADPYDSYDPAVLGPVLREHTDAVWSLSCAGGRLLSSSADGTARLWSPRDPRPLLATLRDDSPGSTSAPAAADFADAACRAAVVYTDGTLLLYDLETGQVVLRVPCDSPANRVRSHPTLPLLVTAHEDRHIRFWDAVSGRCAHAMVAHLDAVTGLALDPNGLFLLSGSHDCSVRLWNLDTKTCVQEITAHRKKFDESILDVAFHPLRPYIASAGADGLAKVFV-