Monarch geneset OGS2.0

DPOGS214673
TranscriptDPOGS214673-TA1446 bp
ProteinDPOGS214673-PA481 aa
Genomic positionDPSCF300321 + 89256-96407
RNAseq coverage811x (Rank: top 16%)
Annotation
HeliconiusHMEL0104723e-11297.46% 
BombyxBGIBMGA001947-TA1e-5995.54% 
DrosophilaHnf4-PB3e-14674.16% 
EBI UniRef50UniRef50_P498668e-14474.16%Transcription factor HNF-4 homolog n=82 Tax=Metazoa RepID=HNF4_DROME
NCBI RefSeqNP_001037474.10.095.00%hepatocyte nuclear factor 4 isoform a [Bombyx mori]
NCBI nr blastpgi|900253570.082.30%SXR-like nuclear receptor [Lymantria dispar]
NCBI nr blastxgi|900253570.085.58%SXR-like nuclear receptor [Lymantria dispar]
Group
Gene OntologyGO:00037072.6e-68steroid hormone receptor activity
GO:00056342.6e-68nucleus
GO:00063552.6e-68regulation of transcription, DNA-dependent
GO:00434012.6e-68steroid hormone mediated signaling pathway
GO:00037002.6e-68sequence-specific DNA binding transcription factor activity
GO:00082701.9e-38zinc ion binding
GO:00435651.9e-38sequence-specific DNA binding
GO:00036777.2e-26DNA binding
GO:00048791.9e-05ligand-dependent nuclear receptor activity
GO:00054961.9e-05steroid binding
KEGG pathway 
InterPro domain[147-402] IPR0089462.6e-68Nuclear hormone receptor, ligand-binding
[214-376] IPR0005363.2e-47Nuclear hormone receptor, ligand-binding, core
[78-149] IPR0016281.9e-38Zinc finger, nuclear hormone receptor-type
[77-146] IPR0130885e-30Zinc finger, NHR/GATA-type
[142-152] IPR0017237.2e-26Steroid hormone receptor
Orthology groupMCL12590 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214673-TA
ATGCCAGTGCTCGGCGTCGGCATGGCTCAGGAAATCGGACTGCGCTACTGTCCCACTACTGATTACATTCTACCGGGGGGGTACTGGGAGAAGAACTCCGTCCAGTATAACATGACCTACCACAGGCAGCACGACGATGCACAGTGTAACAACACAGTGTCATACAACACAGATAGTGATATGCAGCTTGAAACGAGCAGTAGTGAGGCGAGTGCTAGCTCGACCGTGCTGTCCCAACACTGCGCTATATGTGGAGACCGAGCCACCGGCAAGCACTATGGAGCGTCCTCGTGCGACGGATGCAAGGGGTTCTTCAGACGTAGCGTCAGAAAAAACCATCTCTATACATGCAGGTTCAGCAGGAATTGTGTAGTTGACAAGGACAAACGAAATCAGTGCAGATATTGCAGACTAAGGAAGTGCTTTAAGGCCGGCATGAAGAAAGAGGCGGTCCAGAACGAACGTGATCGTATTAACTGCAGACGGCCGTCTTACGAGGAGCCGGCTCAGGCGAACGGACTGTCAGTCGTGTCGCTGTTGAACGCTGAACTACTCAGTAGGAAAGTCATTGACGAGACAAACAACGTAACAGACGCCGAGATAAACAACCGGAAGTTGGCTAAGATCAATGACGTGTGTGACTCCATCAAACAGCAACTACTCATTCTGGTGGAGTGGGCCAAGTACATACCCGCCTTCACGGAGCTGCACTTGGACGATCAGGTGGCGCTGCTGCGGGCCCACGCTGGCGAACACCTGCTGCTGGGTTGTGCTCGTCGGTCGCTCCACCTGCGAGACGTGCTGCTCCTGGGAAACAACTGCATCATCACCAAACACCATCTCGACGGCAGAATGGATATAGACATCAGCATGATCGGCATGAGGGTGATGGATGAGATCGTCAAACCGCTCCGGGAGATCGACATCGACGACACGGAGTTCGCCTGCCTTAAGGCCATCGTCTTCTTCGATCCGAACGCCAAGGGTCTCTCTCAACCGCAGAAGATCAAGCAACTCCGTTACCAGATCCAAATCAACCTGGAGGACTACATCAGCGACCGTCAATACGACGGGCGCGGGCGGTTCGGCGAACTGCTGCTGTGTCTGCCGCCGCTGCAGAGCATCACCTGGCAGATGATCGAGCAGATACAGTTCGCCAAACTGTTCGGAGTCGCGCACATCGACAGCCTGCTGCAGGAGATGCTGTTGGGAGGAGCATCAACAGAAGCGACGCTCGACGAGAGTTCAGCGGGCGGGGAGGGGACCGCGGGGGTCGGGGGCGACTCGGCGGCCGCTGGGGTCGCGGGTGGACACGCCTCGCCACCACTCGTGCCCCAACTGCCTCCCGGTGAACACGTGTTTGACGCGACCTTCAAACAGGAGCCCAACATGAGTCCAGAACATACAGCCCGAGTACTGAAGACCTCGGATATAACACTGTTATAG

Protein sequence:

>DPOGS214673-PA
MPVLGVGMAQEIGLRYCPTTDYILPGGYWEKNSVQYNMTYHRQHDDAQCNNTVSYNTDSDMQLETSSSEASASSTVLSQHCAICGDRATGKHYGASSCDGCKGFFRRSVRKNHLYTCRFSRNCVVDKDKRNQCRYCRLRKCFKAGMKKEAVQNERDRINCRRPSYEEPAQANGLSVVSLLNAELLSRKVIDETNNVTDAEINNRKLAKINDVCDSIKQQLLILVEWAKYIPAFTELHLDDQVALLRAHAGEHLLLGCARRSLHLRDVLLLGNNCIITKHHLDGRMDIDISMIGMRVMDEIVKPLREIDIDDTEFACLKAIVFFDPNAKGLSQPQKIKQLRYQIQINLEDYISDRQYDGRGRFGELLLCLPPLQSITWQMIEQIQFAKLFGVAHIDSLLQEMLLGGASTEATLDESSAGGEGTAGVGGDSAAAGVAGGHASPPLVPQLPPGEHVFDATFKQEPNMSPEHTARVLKTSDITLL-