Monarch geneset OGS2.0

DPOGS203369
TranscriptDPOGS203369-TA1359 bp
ProteinDPOGS203369-PA452 aa
Genomic positionDPSCF300003 + 232400-250217
RNAseq coverage683x (Rank: top 19%)
Annotation
HeliconiusHMEL0179351e-9256.95% 
BombyxBGIBMGA003895-TA2e-7146.76% 
DrosophilaEip75B-PB5e-5232.23% 
EBI UniRef50UniRef50_E5SAL82e-9347.21%Nuclear hormone receptor E75 n=2 Tax=Nematoda RepID=E5SAL8_TRISP
NCBI RefSeqXP_001952732.12e-9643.05%PREDICTED: similar to ecdysone-induced protein 78C [Acyrthosiphon pisum]
NCBI nr blastpgi|1565560612e-9745.86%ecdysone-induced protein 78C [Aedes aegypti]
NCBI nr blastxgi|1565560614e-9246.34%ecdysone-induced protein 78C [Aedes aegypti]
Group
Gene OntologyGO:00037076.2e-51steroid hormone receptor activity
GO:00056346.2e-51nucleus
GO:00063556.2e-51regulation of transcription, DNA-dependent
GO:00434016.2e-51steroid hormone mediated signaling pathway
GO:00037006.2e-51sequence-specific DNA binding transcription factor activity
GO:00082706.8e-39zinc ion binding
GO:00435656.8e-39sequence-specific DNA binding
GO:00036772.6e-17DNA binding
KEGG pathway 
InterPro domain[226-427] IPR0089466.2e-51Nuclear hormone receptor, ligand-binding
[52-123] IPR0016286.8e-39Zinc finger, nuclear hormone receptor-type
[53-118] IPR0130886.7e-31Zinc finger, NHR/GATA-type
[116-126] IPR0017232.6e-17Steroid hormone receptor
[244-384] IPR0005368.1e-16Nuclear hormone receptor, ligand-binding, core
Orthology groupMCL14744 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203369-TA
ATGGACGTGTGGGGCGGCATACCCGCTTCATGCGTTCACGCACCAACTCTGAACATTGAAAACAGTGAATTCGCTCTCACTTTCATTGACAAAGAGACCACAACTTCACAGCAGAAACAAGAGGCGGATAGCGCAGGCGCAGGCGGTAAGGCTGCCGCGCCGTGCAAGGTGTGCGGCGACAAAGCCTCCGGCTACCACTACGGCGTCACATCCTGCGAAGGTTGCAAGGGGTTCTTCAGACGCAGCATCCAGAAACAGATCGAGTACCGCTGTCTGCGGGACGGGAAGTGCCTGGTCATAAGACTCAACAGGAACCGCTGTCAGTTTTGCAGATTCAAGAAGTGCTTGGCTGTAGGCATGAGTCGCGATTCCGTCAGGTACGGCCGGGTGCCGAAGCGGCCACGGGAGGTCGTCAGCTCGGAGAGCATGCCGGACTTGACCAAAAACCTGGGGGTCGGAACCCCAACACCTACCTCCTCGGTAGAGGAAATGGACACGGACTCCCTCCAACTAGAGCTGGCCAGGGACCTCGCTAAAGTGGTGATCACCGCTCATCGCAACAACGACGCGTATTCCGAGGACTACGTCCGCAGCATGCACCTGAAAGCTATCATTGTCAAGACGGAAGACGCTGATAACGACGGAAACGGAGGGGAGGAGGCAGCATGCTCGGGCGCAGTGCGTGCTAATAAACTAACAGCTCTTTGGTACAATGTGGCGGTGCGCATGACGCCCACCGTACAGCAAGTGGTGGAATTCGCCAAGCGTGTGCCCGGATTCAACGTGCTGCCACAGGACGACCAGCTTATACTTATCAAGCTCGGGTTCTTCGAGGTGTGGCTGTCTCGTATGGGCCGGATCTCCTCGGAGGCTACCATCTTATTCGATGACGGGAACTCCATCGACCAAACGCAGCTCGAGCTTATGTATGATATCCCGTTCGCGAAGTCGATGCTGGCGTACACAGCTCGCATTAACAAGATGTGCATCACCGAGGACGAGATGGCGCTCTTCTCCGCCACCCTGCTGCTGTCCCCTCACCGGAACGGGCTGTCTGATAAGGACAAAATAACCGCTCTCCACATGTCCCTAATGGAGGCCTTCCAATTCGTGGTGACCGAGTCCGGAATATCTGATGTGCCGGCCCGCATGGAGGCATTCGCGGTGGCAACTCGCGAAGCCCGTGTCTTGGGTCTGCAGCATAACGACCAGCTGACTTGGTGCCGGCTCAACTGGCAACGGCTGGTGCTGCCCGCTCTCTTCTCTGAGATATTCGACATCCCGAAAGGGGAGGAAGAGGATCCGGAGGCCAGTGCTCTGCCGTCACCTCCCTCATGCTCCGTGCCCCAACAGGGATAA

Protein sequence:

>DPOGS203369-PA
MDVWGGIPASCVHAPTLNIENSEFALTFIDKETTTSQQKQEADSAGAGGKAAAPCKVCGDKASGYHYGVTSCEGCKGFFRRSIQKQIEYRCLRDGKCLVIRLNRNRCQFCRFKKCLAVGMSRDSVRYGRVPKRPREVVSSESMPDLTKNLGVGTPTPTSSVEEMDTDSLQLELARDLAKVVITAHRNNDAYSEDYVRSMHLKAIIVKTEDADNDGNGGEEAACSGAVRANKLTALWYNVAVRMTPTVQQVVEFAKRVPGFNVLPQDDQLILIKLGFFEVWLSRMGRISSEATILFDDGNSIDQTQLELMYDIPFAKSMLAYTARINKMCITEDEMALFSATLLLSPHRNGLSDKDKITALHMSLMEAFQFVVTESGISDVPARMEAFAVATREARVLGLQHNDQLTWCRLNWQRLVLPALFSEIFDIPKGEEEDPEASALPSPPSCSVPQQG-