Monarch geneset OGS2.0

DPOGS209769
TranscriptDPOGS209769-TA2319 bp
ProteinDPOGS209769-PA772 aa
Genomic positionDPSCF300397 - 94328-100767
RNAseq coverage175x (Rank: top 50%)
Annotation
HeliconiusHMEL0148960.071.61% 
BombyxBGIBMGA010283-TA0.071.45% 
Drosophilasip1-PA3e-14437.17% 
EBI UniRef50UniRef50_A1XDB30.065.02%STIP n=2 Tax=Obtectomera RepID=A1XDB3_BOMMO
NCBI RefSeqNP_001091840.10.065.02%septin and tuftelin interacting protein [Bombyx mori]
NCBI nr blastpgi|1482988310.065.02%septin and tuftelin interacting protein [Bombyx mori]
NCBI nr blastxgi|1482988310.065.27%septin and tuftelin interacting protein [Bombyx mori]
Group
Gene OntologyGO:00056347.4e-52nucleus
GO:00036777.4e-52DNA binding
GO:00063557.4e-52regulation of transcription, DNA-dependent
GO:00037007.4e-52sequence-specific DNA binding transcription factor activity
GO:00056224.3e-16intracellular
GO:00036764.3e-16nucleic acid binding
KEGG pathway 
InterPro domain[369-619] IPR0227837.4e-52GC-rich sequence DNA-binding factor domain
[3-93] IPR0221596.2e-24Tuftelin interacting protein N-terminal
[137-183] IPR0004674.3e-16D111/G-patch
Orthology groupMCL12782 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209769-TA
ATGTCTGATGATGAGGTTATACGTTTTGAAATCACCGACTACGATTTGGATAATGAATTCAATCCCAACAGAAGTCGGAGGGCTAAGAAGGAACACCAAATATACGGTGTTTGGTCGAAAGATAGTGATGACGAGGAAAATGAAGACAATATAAGGCGACGTATACGTAAACCGAAAGATTTTTCAGCTCCAATAGATTTTGTAACTGGTGGAGTGCAGCAGGCCGGCAAGAAGAAGGATGAAAAGCAAGACATACAAAAATCGGAGTCGTCTACATCTCGTCCCAAATTTGCGGATAGTTCTGATGATGAAGTTTTGGAACCGGAAGCGCGGGAGACTGCGGGGATAAGAAAAGCCGGACAGGGTTTGAGATCTGGACAAAATTTAGGTGGAGTTGGTGCTTGGGAGAGACATACTAAAGGCATTGGAGCTAAATTATTATTACAGATGGGGTATCAACCTGGTAAGGGTTTGGGTAAAGAGCTGCAAGGTATCTCCGCTCCCGTAGAAGCTACAGTCAGGAAGGGCAGAGGTGCTATTGGGGCATATGGACCTGAAAAGGCTGCGCAAAAAGCTAAAAAGGAAGAACAGAAGCGGCTGAAAGAAAAAGAGGGAGATAAAAGTACTACAGAAAAGAGTTATAACTGGAAGAAATCACATAAGGGCAGATACTTCTACCGAGATGCAGCCGATGTCATACAAGAGGGTAAACCCACCATGCATACTATTAGCAGTAACGAGCTGGCCCGCGTGCCGGTTATAGACCTGACCGGCCGGGAGAAGAAAGTATTGAGCGGCTACCACGCCCTGCGAGCCGCCGCGCCGCGGTATGAACACGAACCCCGGAGGGAGTGCACTAACTTCGCAGCACCAGAACTCACTCACAACTTGCAGCTGATGGTGGATTGTTGTGAACAGGACATTATCCAAAACGCTCGCGAACTCCAACAGTCTGAGGACGAGATCGTGGTGATAGAGCGTGATCTCGAAGACTGTAAGATAAAGTTAGGTGAAGAAGATGACGTCATCAGGACGCTGGAAGGCATACTGGCGAGGGTGGAGGTACTGAACAGGCCGGACGCCTCGCTCGAGATGGGCTATGAGGTGCTGAGGGATCTCAAGGAAACATACCCCTTGGAATATGAGATGTTCAGTCTGGGTAATATAGGGGGTAACATAGTGAGTCCCCTATTCAGTGCCATGATGGCCTCGTGGAGTCCCCTGACGGACCCCGGGGGCGTAGCACCCGTGTTCCTCAAGTGGAGGCCCCTTCTGACGGAAGAGGCCTACAATAATCTTCTATGGCAACATTTTGGAACTAAAATCGAAATGACCGTTGAAGAGTGGAATCCTCGCAATCCGGAGCCTATGGTCCTCGTGTTCAAGTCGTGGGTGTCCGTGTGTCCGTCCTGGCTGGTGAGCTCGTGTGTCACGAGATACGTCGCCCCCCGCCTGGTGACCGCGGTCCGGGACTGGGATCCCACGGGAGATACGCAGCCCTTACACCAGTGGGTGCTGCCCTGGCACGAGTTCGCTGGGGAAGCTCTCAACGCTTCCGTGTATCCGTTGATTCGTTCCCGGCTGTCGTCAGCGCTGGCTGCCTGGCACCCCGCGGACTCGTCGGCCAGGCCGCTGCTGGCGGTTTGGCGAAGTTCGTGGGGCCCCGCTCTCACGACCCTCTTACATCATCACATCGTACCTAAACTGGAGCACTGTTTGCAGAACGCTCCTTTGGAACTCGTAGGAAGGGAGAATACCGCGTGGCTCTGGTGTGTGGATTGGGTGGAGTTGATCGGCGCCCCGACAATAGCCAGCCTGGCGGGTCGAGCTCTAATGCCGCGCTGGTTGGCGGCATTGGCCGCTTGGCTGAACACTTCCCCACCGCACGCCACTGTACTAAACTCGTACACGGAGTTCAAGAAAATGTTCCCGGAAGACGTCCTCAAAGAACCGCCCGTGCGTGACGCGTTCCGCAAAGCCTTGGACATGATGAACAGGAGTACGGACATAGATTCCATAGAGCCGCCCCCACCGCCGCGCTTCACCATACCAGAACCGAAAGAATCTTCCAGGATAAGCGACGTCCTGGCAACGATAACGCAACAGAAGAGCTTTTCAGAACTGCTCGAATCCAGGTGCATAGAACGCGGCATCACTTTTGTGCCTATAGTGGGCAAGAGTAGGGAGGGCAGGCCGTTGTATAAGATCGGTGAACTGCAGTGTTACGTCATCAGGAACGTCATCATGTACTCCGATGATGGCGGACGGAGCTTCGGGCCCATCGGCCTCGATAGGCTCCTGAGTTTGGTTGAGGATTAA

Protein sequence:

>DPOGS209769-PA
MSDDEVIRFEITDYDLDNEFNPNRSRRAKKEHQIYGVWSKDSDDEENEDNIRRRIRKPKDFSAPIDFVTGGVQQAGKKKDEKQDIQKSESSTSRPKFADSSDDEVLEPEARETAGIRKAGQGLRSGQNLGGVGAWERHTKGIGAKLLLQMGYQPGKGLGKELQGISAPVEATVRKGRGAIGAYGPEKAAQKAKKEEQKRLKEKEGDKSTTEKSYNWKKSHKGRYFYRDAADVIQEGKPTMHTISSNELARVPVIDLTGREKKVLSGYHALRAAAPRYEHEPRRECTNFAAPELTHNLQLMVDCCEQDIIQNARELQQSEDEIVVIERDLEDCKIKLGEEDDVIRTLEGILARVEVLNRPDASLEMGYEVLRDLKETYPLEYEMFSLGNIGGNIVSPLFSAMMASWSPLTDPGGVAPVFLKWRPLLTEEAYNNLLWQHFGTKIEMTVEEWNPRNPEPMVLVFKSWVSVCPSWLVSSCVTRYVAPRLVTAVRDWDPTGDTQPLHQWVLPWHEFAGEALNASVYPLIRSRLSSALAAWHPADSSARPLLAVWRSSWGPALTTLLHHHIVPKLEHCLQNAPLELVGRENTAWLWCVDWVELIGAPTIASLAGRALMPRWLAALAAWLNTSPPHATVLNSYTEFKKMFPEDVLKEPPVRDAFRKALDMMNRSTDIDSIEPPPPPRFTIPEPKESSRISDVLATITQQKSFSELLESRCIERGITFVPIVGKSREGRPLYKIGELQCYVIRNVIMYSDDGGRSFGPIGLDRLLSLVED-