Monarch geneset OGS2.0

DPOGS206043
TranscriptDPOGS206043-TA1551 bp
ProteinDPOGS206043-PA516 aa
Genomic positionDPSCF300028 - 1274054-1284631
RNAseq coverage270x (Rank: top 40%)
Annotation
HeliconiusHMEL0087750.095.53% 
BombyxBGIBMGA000716-TA0.090.79% 
Drosophilaftz-f1-PB5e-7778.11% 
EBI UniRef50UniRef50_P498674e-16088.89%Nuclear hormone receptor FTZ-F1 n=8 Tax=Pancrustacea RepID=FTZF1_BOMMO
NCBI RefSeqNP_001037528.28e-16188.89%nuclear hormone receptor FTZ-F1 [Bombyx mori]
NCBI nr blastpgi|179796700.092.63%nuclear hormone receptor betaFTZ-F1 [Manduca sexta]
NCBI nr blastxgi|179796700.092.63%nuclear hormone receptor betaFTZ-F1 [Manduca sexta]
Group
Gene OntologyGO:00056343.3e-38nucleus
GO:00063553.3e-38regulation of transcription, DNA-dependent
GO:00435653.3e-38sequence-specific DNA binding
GO:00082703.3e-38zinc ion binding
GO:00037003.3e-38sequence-specific DNA binding transcription factor activity
GO:00037071.6e-18steroid hormone receptor activity
GO:00434011.6e-18steroid hormone mediated signaling pathway
KEGG pathway 
InterPro domain[42-517] IPR0163551.7e-130Steroidogenic factor 1
[95-166] IPR0016283.3e-38Zinc finger, nuclear hormone receptor-type
[94-161] IPR0130886.4e-31Zinc finger, NHR/GATA-type
[336-378] IPR0089461.6e-18Nuclear hormone receptor, ligand-binding
Orthology groupMCL12629 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206043-TA
ATGACGATGGACCAGCAGACAGGCCTCATGTCTCTAAACATGTCCCCGTTCGACCTCAGTCCGGGCCCCGAAGGGTCCGGCTCGGGCGGTGGACCCTCGGGTGCCTCCCAGCAGTATGTGCCGCAGGGCGCCGCTTATCAATGCACATCAGACCAACAGCCTTTCGGCTACGCAAACCTGGATGCTTCATATCTCTTTCCAACAGGTGCCGGAAGCGAATCTGGAGCTTATTTACCAGCAGCCGGGACTGTCTGCGATCAAACCGACACCAAGGACGTCATAGAAGAACTCTGTCCCGTTTGTGGGGACAAGGTGAGCGGTTACCATTACGGGTTGCTCACTTGCGAGTCCTGTAAAGGTTTCTTCAAGAGGACCGTTCAGAACAAGAAAGTATACACCTGCGTCGCGGAACGCGCCTGCCACATTGATAAAACACAGAGGAAACGATGCCCATTCTGCCGCTTCCAAAAGTGTCTCGACGTCGGCATGAAACTAGAGGCCGTACGTGCGGATCGCATGCGTGGCGGACGTAATAAGTTCGGCCCCATGTACAAACGCGACCGCGCCAGGAAACTGCAAATGATGAGACAAAGGCAAATCGCTGTTCAAACGCTGCGGGGATCGCTCGGTGACAGCGGGATCGTGCTCGGTTTTAATTCACCCTACGCGTCTGTGCCAGTCAAACAGGAAATACAGATACCGCAGGTGTCGTCGCTGACGTCATCGCCCGAGTCGTCGCCTGGGCCGGCTTTGCTGGCCGCTCAGCCGCAGGCCGCCCAGCCTCCGCCTCCTCCTGCACACGACAAGTGGGAGACTCACTCTCCTCACTCGGCGTCCCCAGACGCGTTCGCGTTCGACGCGCCCGCCACCACGGCCGCCACGCCCTCCAGCACGGCCGAGCCCACCAGCACGGAAACACTGCGAGTGTCGCCCATGATACGAGAATTCGTTCAGACCATCGACGACAGGGAGTGGCAAAATTCACTGTTCGGACTCTTGCAGAGCCAAACCTACAATCAGTGTGAAGTGGATCTGTTCGAGTTAATGTGCAAAGTGCTGGACCAAAACTTGTTCTCTCAAGTGGATTGGGCGAGAAACACCGTGTTCTTTAAGTATCTAAAGAGTCGCAGCCAGTCACGTGGTTTTGCGACGTTGCCGCTCCGCGCATCCCGCCGTCTCGCTCCGTCAGACAACTTCCACACCACAACTAGCAATTTTCATTTGTTCCAAATGACGCGACTTGTTTACGACCCATTCTCGGAATTAGGCCGGCCAACGAAACGTGTACATGAATCTCGCGTTTTATCACTACCGAAGAAAAATATACGAAACGACGATAATTCCGATTCCACTCCCATTATAGATATGTTTCGGCCCGTTCCCTACCGTACTCGGCGGGATAGACGCGTGATGTTGATGACCTACGCTTTCTCCAGACGGATGTGTTGTAAGCGTTCGGGACGGCAACGCTGTACTTTCCTTGCCGCACTTGACACATTGCCCGTGGCCGAGTTGCCGCGAGAAAGCACTTTTTACCCGCGTAACCGCTAA

Protein sequence:

>DPOGS206043-PA
MTMDQQTGLMSLNMSPFDLSPGPEGSGSGGGPSGASQQYVPQGAAYQCTSDQQPFGYANLDASYLFPTGAGSESGAYLPAAGTVCDQTDTKDVIEELCPVCGDKVSGYHYGLLTCESCKGFFKRTVQNKKVYTCVAERACHIDKTQRKRCPFCRFQKCLDVGMKLEAVRADRMRGGRNKFGPMYKRDRARKLQMMRQRQIAVQTLRGSLGDSGIVLGFNSPYASVPVKQEIQIPQVSSLTSSPESSPGPALLAAQPQAAQPPPPPAHDKWETHSPHSASPDAFAFDAPATTAATPSSTAEPTSTETLRVSPMIREFVQTIDDREWQNSLFGLLQSQTYNQCEVDLFELMCKVLDQNLFSQVDWARNTVFFKYLKSRSQSRGFATLPLRASRRLAPSDNFHTTTSNFHLFQMTRLVYDPFSELGRPTKRVHESRVLSLPKKNIRNDDNSDSTPIIDMFRPVPYRTRRDRRVMLMTYAFSRRMCCKRSGRQRCTFLAALDTLPVAELPRESTFYPRNR-