Monarch geneset OGS2.0

DPOGS215846
TranscriptDPOGS215846-TA4374 bp
ProteinDPOGS215846-PA1457 aa
Genomic positionDPSCF300073 + 710279-725738
RNAseq coverage204x (Rank: top 47%)
Annotation
HeliconiusHMEL0074570.088.51% 
BombyxBGIBMGA002946-TA0.084.59% 
Drosophilasalm-PA2e-7160.39% 
EBI UniRef50UniRef50_D6WL630.060.67%Spalt n=5 Tax=Tribolium castaneum RepID=D6WL63_TRICA
NCBI RefSeqXP_973229.20.061.76%PREDICTED: spalt-like protein [Tribolium castaneum]
NCBI nr blastpgi|1892373980.061.76%PREDICTED: spalt-like protein [Tribolium castaneum]
NCBI nr blastxgi|2700081620.052.34%spalt [Tribolium castaneum]
Group
Gene OntologyGO:00036766.7e-13nucleic acid binding
GO:00082707.1e-05zinc ion binding
GO:00056227.1e-05intracellular
KEGG pathway 
InterPro domain[491-523] IPR0130876.7e-13Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL16643 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215846-TA
ATGAGCCGGGCGCGGCTCCGCCTCCTTGTCATAACCTTATACCTGGTGCGCGCGCGCGACTCCCGCCGTCCTCCACCACCCGCCACCACCGCCCGGAGGCTCGCGGCTCGCGCTCAGTCCCGCGTCAGATGGCGAGAGGTGAGCCCGTCTGTATACCGCGTCCGATCACACAGGCTCCGCGCGAGTGAGTACTCTACCACCGCCGGCGATGCGAGGGCTGTGAGCCGCCAGCGCTACCACCGACCACCTACCACCGACCGACCACCGGCCAGCATGCCGCGCGTCAAGCCAGCCTGCGTCCGACGCGTCTCCATCGGTACATATCCATCAGCTCGATCACCAAACCGACACACGACCGTACATAGGATCTGTGATAGCTCGGGATCCTGTTCAGAAGAGGATATAAATAATGTGTTACCCGATGACGGAAGAGATCGGCCTGAGCCACATATGTGCCCTCGCTGTCATGAACAGTTCGAGAACCTACACGACTTCTTGTATCACAAGAGGGTTTGTGATGAAAAAGCAATACAAATGGGTGAAGAAAGAATGCACTCTGATCCCGAGGATATGGTAGTGTCGGGGGATGAAGAAATGGATGGACCCAATAAGCGGTTAGATCAAGTAAGAAGGCATCGGCAGGATGCTGAAAATAATAACAGCCTAGAGGATGGTGAAGCTGAGGTACCTGAAGCCGATATGCCCCCCGTAGGTCTTCCCTTCCCCGTAGCAGGCCATGTTACTCTTGAAGCTCTTCAGAATACAAAAGTAGCTGTTGCTCAATTCGCTGCGACGGCAATGGCTAATAATGCAAATAATGAAGCCGCTTTACATGAGCTTGCCGTATTACAAAGTACGCTATTTACTCTACAGCATCAACAAGTATTTCAACTTCAGTTAATACGCCAACTACAAAATCAATTGTCATTAACACGACGGAAAGAGGATCAACATTCAAGCCCGCCTCCGAGTGAAATTGAACAAAACGCTCCGTCGACGCCAGCTCGCTCGCCTTCACCACCGCGTCCGCCACGGGAGCCATCTCCTGCTGAACCCACTCATCCTACTAGCCAAAATTTGCCGTCTACCCACACAAATCTTACGCCCAAAACGGAACCTATCTCTGTTCCCAAGCTTCCGACTTCATCGCCACCAATGATGTCCCACCCACCTTACAGCTCCATATCTTCCTCTCTAGCATCTTCAATCATTACAAACAACGATCCTCCGCCATCTCTTAATGAACCAAACACTCTCGAAATGCTTCAAAAACGTGCACAAGAAGTACTAGATAACGCATCACAAGGCCTTCTCGCTAATAATCTTGCCGATGAACTAGCTTTTAGAAAATCTGGAAAAATGTCACCCTATGACGGAAAATCTGGTGGACGCAACGAACCATTCTTTAAACATCGCTGTCGATATTGTGGAAAAGTTTTTGGAAGTGACTCAGCGCTTCAGATTCATATAAGATCTCATACAGGCGAAAGACCATTTAAATGTAACGTTTGTGGATCTCGCTTTACAACGAAAGGAAACCTTAAAGTACATTTCCAAAGACATACTTCTAAATTTCCACATGTTAAGATGAATCCCAACCCTGTACCCGAACATTTAGACAAGTATCATCCACCACTACTAGCACAACTATCACCGGGGCCGATACCGGGAATGCCACCCCATCCACTTCAATTCAATCCAGGTGCACCAGCACCTTTTCCACCAAGCTTGCCATTATATAGGCCACCACACCATGATTTACTTCCACCTCGACCTCTTGGAGATAAACCTCTTTCACATCATCCACTTTTCGCGATGCGAGAAGAACAAGACGTACCAGCAGATCTCAGCAAACCATCTGCACCAAGTCCTACTCAGCCTGCATCTGAGGTTTTTAAATCCGAACCACAAGATGAAGAAAGCCAACGTGACTCTAGCTTTGACGAGTCTGATCGCATATCACCTAAAAGAGAACCTGAAGAAAACGATGCTGGACAAGATCAAGAACACGATCGATATCCATCAACATCGCCCTATGATGACTGCAGCATGGATTCAAAATACAGCAACGAAGACCAAATCGGCAGAGAAAGTCCACAAGTGAAGGCTGATCCAGACCAACCGGAAAATCTTTCAAGTAAGAATTCTACGATATCTGGACCAATTTCAATAGCAACGGGACTTCGTACTTATCCTTCTTATCCATTGTTTCCACAATCCCCACCTAGCAGCGTGTCTTCTGAGAGTCTTACTCCGTTTTCAAATAATCCTGTCCTTGGAGACACTGATGTAACACGAGACCCCATATTTTATAATTCGCTCTTACCGCGTCCAGGTAGCAACGATAATTCTTGGGAAAGTTTAATAGAAATTACAAAAACATCTGAAACCTCCAAATTACAGCAACTAGTTGATAATATTGATAACAAAGTATCTGAGCCTAATGAATGCATAGTTTGTCACCGTGTTCTGTCTTGTAAAAGTGCTTTACAAATGCACTACCGCACACATACTGGTGAGCGCCCATTTCGATGTAAATTATGCGGTCGTGCTTTTACCACAAAAGGTAATTTGAAGACACATATGGGTGTACACCGCATAAAACCTCCCTCACAATTATTGCATCAATGTCCCGTGTGCCATAAAAAGTTTACAGATCCTTCTATGTTACATCAACATATCAGAATTCATACGGGTGAACGACTCAATAACCCTTTCAATGAAGTTAACGACAATAACGCGAATAGTTGTCAGTCTTATAACAACGAATCGGACATTACAGACTGTTCTTATCGTCCCATTCCGGCTCCAATTTTCCCTACACCTTCCACTCCCGGCGACCGAAGGGCGGACTCCCGCGGGACCGACGATGAGAGCGGCAGAGATTCTCGGGAGTTCGATGAGGACTCAGATATAAAAAATCGTCGAGCTTCGCCGCTCTCCGTCTGCGCCTCGGCGTCCGAATGCGAAGTAAAAACCATCACCACAACGGCTTCCCTCCCATCGGCGACAGGTTCGGAGAGCGGGCGCAGTGCACGGGCCTCCCCACCGTCGCCGTCTCCGTCGGTGCTGTCGACGCCCCCGCGGCTGCCGCAGCACTCGCCGCTGCCTTCGCCGCCGACCCCGCTCGCCGCGCTCGGTGCTCTCGGTGGACCTTTCAGCCCGCTCGGACTCGCTTTCCCTCCCGCAGTGCGCGGAAATACAACATGTACTATCTGCTACAAGACTTTCGCCTGCAATTCGGCGTTGGAAATCCATTACCGCAGTCACACTAAAGAGCGACCATTCAAATGCACCGTCTGCGACAGAGGCTTCTCTACTAAGAGCAGTGGCGGCGGTTGCCGGTGTAGGCGCCCACGCGCACCCCGCCCGCCGCACGCCACTGCTTTGGACCTCTGGAACGCCTTCGTCTACCCGGGAAACATGAAACAGCACATGTTGACGCACAAGATACGTGACATGCCGCCTGGTTTCGAAAAGAGCGTAAGTGGACCAACTGGTCCGCCAAGTGAAGAAGGCCGGGAACCAAGTCCAGACAGACGATCGTCTCCTGAGAAGATCGATTTGAAACGATCACCCCCCGCCTTACCCCCCGTCACTATAGCGCACCCACCCCCCATTGATATGCCACCACTGCCTAAAAGACCTACAGTACCTATAGTACCAAATCATCCGCCACCATCTCAGTCATCGAAGCACCTTTGCGGAGTTTGTCGCAAAAACTTCTCATCGTCATCGGCTTTACAAATCCACATGCGCACGCACACCGGAGACAAACCCTTCCGATGCGCTGTTTGCCAAAAAGCTTTTACCACCAAAGGAAACCTTAAGGTTCACATGGGCACACACATGTGGAGCGGAGGTGCGTCTCGTCGCGGCCGTCGTATGTCTCTGGAACTGCCCCCACGGCCGCTACATGAGCCTCACGACCTTCTCCGACGACCTGATCTCTTCTACCCCTACTTGCCTGCGCCATTTCTCAATGGCATGCAACAAAAACTGAACGAGATATCAGTAATACAACAACAAAATGCCGGCCAGAATGGCGTAGCTGGTAAATTCCCGGGACTACTTGGTTTCGGAGCTTTTGGAGCTGGAAGACCGGGTGCCGCCTCCCCTCTCGAGAGGCCACCGTCACTCGACGGTGATGAACGTCAGGCGGCGATGAGGGAACTCGCTGAGAGAGGCAGGGAGCTAGCTGAACGGAGCCGACAGATGCGTGAGGAAAATGAACGGGAGCACTACCGGTCCGCTGGCGGTCATCCAGCGCATGCGCCACCGACAGCACAGGCTTCGCCTCCAGCACCACATAATCTACCGCATCCCCTGGCATCAATACCGCCGCCCGCGCGCACCGAAGGTCTCACAGTTTAA

Protein sequence:

>DPOGS215846-PA
MSRARLRLLVITLYLVRARDSRRPPPPATTARRLAARAQSRVRWREVSPSVYRVRSHRLRASEYSTTAGDARAVSRQRYHRPPTTDRPPASMPRVKPACVRRVSIGTYPSARSPNRHTTVHRICDSSGSCSEEDINNVLPDDGRDRPEPHMCPRCHEQFENLHDFLYHKRVCDEKAIQMGEERMHSDPEDMVVSGDEEMDGPNKRLDQVRRHRQDAENNNSLEDGEAEVPEADMPPVGLPFPVAGHVTLEALQNTKVAVAQFAATAMANNANNEAALHELAVLQSTLFTLQHQQVFQLQLIRQLQNQLSLTRRKEDQHSSPPPSEIEQNAPSTPARSPSPPRPPREPSPAEPTHPTSQNLPSTHTNLTPKTEPISVPKLPTSSPPMMSHPPYSSISSSLASSIITNNDPPPSLNEPNTLEMLQKRAQEVLDNASQGLLANNLADELAFRKSGKMSPYDGKSGGRNEPFFKHRCRYCGKVFGSDSALQIHIRSHTGERPFKCNVCGSRFTTKGNLKVHFQRHTSKFPHVKMNPNPVPEHLDKYHPPLLAQLSPGPIPGMPPHPLQFNPGAPAPFPPSLPLYRPPHHDLLPPRPLGDKPLSHHPLFAMREEQDVPADLSKPSAPSPTQPASEVFKSEPQDEESQRDSSFDESDRISPKREPEENDAGQDQEHDRYPSTSPYDDCSMDSKYSNEDQIGRESPQVKADPDQPENLSSKNSTISGPISIATGLRTYPSYPLFPQSPPSSVSSESLTPFSNNPVLGDTDVTRDPIFYNSLLPRPGSNDNSWESLIEITKTSETSKLQQLVDNIDNKVSEPNECIVCHRVLSCKSALQMHYRTHTGERPFRCKLCGRAFTTKGNLKTHMGVHRIKPPSQLLHQCPVCHKKFTDPSMLHQHIRIHTGERLNNPFNEVNDNNANSCQSYNNESDITDCSYRPIPAPIFPTPSTPGDRRADSRGTDDESGRDSREFDEDSDIKNRRASPLSVCASASECEVKTITTTASLPSATGSESGRSARASPPSPSPSVLSTPPRLPQHSPLPSPPTPLAALGALGGPFSPLGLAFPPAVRGNTTCTICYKTFACNSALEIHYRSHTKERPFKCTVCDRGFSTKSSGGGCRCRRPRAPRPPHATALDLWNAFVYPGNMKQHMLTHKIRDMPPGFEKSVSGPTGPPSEEGREPSPDRRSSPEKIDLKRSPPALPPVTIAHPPPIDMPPLPKRPTVPIVPNHPPPSQSSKHLCGVCRKNFSSSSALQIHMRTHTGDKPFRCAVCQKAFTTKGNLKVHMGTHMWSGGASRRGRRMSLELPPRPLHEPHDLLRRPDLFYPYLPAPFLNGMQQKLNEISVIQQQNAGQNGVAGKFPGLLGFGAFGAGRPGAASPLERPPSLDGDERQAAMRELAERGRELAERSRQMREENEREHYRSAGGHPAHAPPTAQASPPAPHNLPHPLASIPPPARTEGLTV-