Monarch geneset OGS2.0

DPOGS203676
TranscriptDPOGS203676-TA5535 bp
ProteinDPOGS203676-PA1844 aa
Genomic positionDPSCF300010 - 2229354-2247958
RNAseq coverage479x (Rank: top 26%)
Annotation
HeliconiusHMEL0133293e-13256.42% 
BombyxBGIBMGA003473-TA0.060.84% 
Drosophilal(3)76BDm-PA0.039.81% 
EBI UniRef50UniRef50_E2AHV20.040.34%Protein TRS85-like protein n=1 Tax=Camponotus floridanus RepID=E2AHV2_CAMFO
NCBI RefSeqXP_970422.20.060.51%PREDICTED: similar to arylhydrocarbon receptor nuclear translocator homolog b [Tribolium castaneum]
NCBI nr blastpgi|1892336190.060.51%PREDICTED: similar to arylhydrocarbon receptor nuclear translocator homolog b [Tribolium castaneum]
NCBI nr blastxgi|1892336190.060.51%PREDICTED: similar to arylhydrocarbon receptor nuclear translocator homolog b [Tribolium castaneum]
Group
Gene OntologyGO:00056347.1e-82nucleus
GO:00063557.1e-82regulation of transcription, DNA-dependent
GO:00037007.1e-82sequence-specific DNA binding transcription factor activity
GO:00055152e-18protein binding
GO:00071656.3e-10signal transduction
GO:00048716.3e-10signal transducer activity
KEGG pathwaytca:6589880.0 
 K09097 (HIF1B, ARNT)maps-> Pathways in cancer
    Renal cell carcinoma
InterPro domain[67-82] IPR0010677.1e-82Nuclear translocator
[53-109] IPR0115981e-19Helix-loop-helix DNA-binding
[333-419] IPR0136552e-18PAS fold-3
[58-111] IPR0010921.2e-13Helix-loop-helix DNA-binding domain
[129-231] IPR0137671.1e-10PAS fold
[126-193] IPR0000146.3e-10PAS
Orthology groupMCL12057 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203676-TA
ATGTCTGCCGTGGCTCCCACTATTCCCGGGGCTGACCCAACAAAGGACATACAAAAGCGTCGAGCTGGTAGTATTGGATCAGATGAAGACGACGCAAGTGGTGGGAAATATACAAGGATGGAGGAGGACAATATTCAAGACAAGGAGAGGTTTGCCAGTCGTGAAAACCACTGCGAGATCGAACGTCGTCGCAGGAACAAGATGACGGCGTATATTACGGAACTATCTGACATGGTTCCAACATGCTCCGCTCTCGCGAGGAAACCAGACAAACTAACCATACTGCGTATGGCAGTAGCGCATATGAAAGCTTTAAGAGGTACCGGCAACACGTCTACAGACGGCACATACAAACCATCGTTTCTAACGGATCAGGAGTTGAAACACCTCATACTGGAAGCGGCCGATGGATTTTTATTCGTCGTCAGTTGTGACACTGGCCGCATTATATACGTTAGTGATAGTATAGCGCCAGTGCTAAATTATTCTCAGGGTGAATGGTACTCATCATGTTTCTACGACCAAGTACATCCCGACGATTTGGAAAAAGTTCGGGAACAGCTAAGCACACAGGAGCCTCAGAACACGGGACGTATTCTGGATCTCAAAACGGGAACCGTTAAGAAGGAGGGACACCAATCTTCAATGCGTCAAGTGATGGGTTCTCGTCGCGGGTTCATATGCCGCATGCGTGTTGGCGGGACGGCGGAGAGCGCTCACCTGGGCAGGCTGCGCGCGCGCAACTCGCTCGGCCCCTCACACGACGGACACAACTACGCGGTCGTACACTGCACCGGTTACATCAAGAACTGGCCGCCGACAGATCTGTTTCCAGGCATGCAGATGGACCGACCGGTCGAGGACGAGTTGCACGCCTCTCATTGTTGCCTCGTCGCTATTGGTAGATTACAGGTGACGTCAACTCCGAGTAGCGCTGAGGGCAGCGCGTGCGGTGGCGTGGAGTTCGTGTCGCGTCACTCTGTGGAGGGCCGCTTCACGTTCGCTGACCAGCGTGCGGCTCAAGTACTAGGGTATGCGCCCGCGGACCTCCTCGGGAAACTCTGCTACGACTTTTACCATCCGGAAGACCAGCAGCACATGAGAGATAACTTTGATCAAGTTCTTAAGTTGAAAGGGCAGATAATTTCGCTCATGTATCGGTTTCGCACAAAGAACAGAGAGTGGATTTGGTTGAGGACATCCGCCTTTGCATTCCTCAATCCTTACAACGATGACGTGGAATATATTGTGTGCACAAATACATTGGCCAACCGTTCATTGGGCAGCACGGGAGGCGAGCCGGTTGCAGATGAAAACTACGATTACCACCTCCGACAGCGTGATGTGTACCAAGCACCTCCTCCGCCTATACATCAGCAACACCATGCTCCACCAGGTGGTGGAGTGGGTGCTCGTTCGCCGGGTGAGGCGGGTGGTGCGGGCGCTGCGGCTGCCTACGCTCCACATGCGCCTCACTATGCGCCTGACTACTCACCTCACCGACCCGCCAACACACCGCCACATACTACTTGGACAACGCTGCGACCGAGCGGGGCCAGTGGAGCCGGGAGTGGTGGTGAAAGTTACGCGTACAGCGGCGACGCGGCTGGAGCGGGTGCGGGTAACGGCAGCCCGGCCCGATCACCGCCCGCGCCCGCCTACCTACCACCAGCACACTACCACCACAACCACCACCCCCCACACCCTACACATCCAACGCACCCTACACATCCCACGCATCCACCGCATGCGGGTATATGGGCATGGCAGGGCGGTGCTGGGGGTGCAGGGGGCCCAGCTGCTGGCGCTCCTGAGGGAGGACACGCGCCCCACGAGCTCTCCGAAATGCTGCAAATACTGGACCAAGGCGGTGCCGCTACCTTCGAAGACCTCAACATAAACATTATAGTCGCAATGACTAAAACAGCGCAAGAATTTATACAAAACTCATTCTCTCCAATAATTGCTACGTTATGCAGTCCAAAAGTGGATAATATAGTATGTAAAAATAATCTAACTTTACCGGAGTTGCTGCAGCCGTTTACGCAACTAGACTGGGAAGGTCAAATTCGAGAACCTGGAGGAAATTTTTGTTCAATAAGAAATTTAAATTTAGTCATTAGAGATGTGCTTTGGAAGCCACTTCAGCCAACAGAGGCCCGAAGACAACTTAATAATGCAGTTTTACATAATTATGATGAGAAAACTGTTATAATGAAAATAGATACACAAATATTTGATGTACCTCAAGCAACACCGTGGTTTGATGCATGGAGGGAAACATATTTGGAAGTGCAATTCCCCTCTGACCATGAATTTACAAAACATTTCTTGGCTTGTATAATAGTTGCCACCACTTTTGAAGATGACATAGTTGAAACATACAACACTCTTAACCAACAATATGCACAACTTCAAAATGTCACCCCTCCCAAACTACCAAAGTGGTTTAACAGTTCAGTTTTGAAGTCTTATCTCTTATTACATGATGTTTCTGCAATTTCCAAAGAGAAAGCTGATAAAATGTTTGAAACAATGAAACAAACCTTTGGTGCTACCAACTGTTTTATGCTATCAATTAATTCCAAGACTTCTAACGAACCCAAGTTAACAACAGACTATTGGGCAAATTATCTTAAACAATCATCAGAAAGCAGTGTTGAAACAGTTTCACTTTCAAGCTCTTCCAGTATACATGATGCTCAACTTACAGCTCAACCAGGTCATGTCAGTTTATTAGATGATAGACAGAATACTTTAACAACATCAACAAATGCGGATGGTGTCCATCCTTTGACAAATACTGAATCAGACACATACGACAATGCAAATAGTTTACAACCATATTCGTTCTCCGGTAGTTCGGAAAGTGTAAATGTTGCTGGAAGACAATTAGAGAGACAGAGTGAGATTCATGGAGCAGCATTAAATGCCAATGACATTGAGTCAATGAAATCATTCTTAAAAGACTATGTATCTAAAGCATTTGTTCCATATTTAGAAAAGCTTATAGCACAACTAAATGAAGTGGTCGCAAATAAAAAAGGCGTCAGTAGATCTTTGCTGTCGGCGACGAAGCGTTGGTTTACAGCTGGAAAGTCTGGAACAACAACTGTGAACAATACGGTCATTTATTCGAGCGACTGTCCGGAGCTACAGCTTCGTCGTCTTGGCGACGTGTGGTTCATGTGTGGTCAATGGTCCCGTGCTTTCGATAGTTACCACGCGGCCAAGCGTGAGTTCTACGCGGATAGCGCTTGGCTTTGTTATGCCGGGGCTCTTGAAATGGCCGCGGTTAGTGCCTTCATGGCCGGCGATGCGAACCGGAAGACCTTTGGCTATATGGAGGAAAGCATTGTCACATATCTGAATACGTGCCGAATGGTGCAATATGCTGTTCGAGCTACTCTCCTCTCCGTGCCGTGTCTCATCAGCGCTGGGTTCTATGGCGAGGCCGCTAAACAACTTATCCGAATGACTTCAGAAGACAGCGATTTGCGGAGTGCTATGCTTCTAGAACAGGCGGCACTTTGTTTCTTGAAAGGGCCCTCTACCAAAATAATGTCGAGAAAATACGCTTTCCACATGGTTTTAGCCGGACACAGGTTCTCCAAAGCCGGACAAAAAAAACACGCGTACCGCTGCTACAAGCGGGCGTATCAGGTGTATGAGGATAGTGGCTGGCGGCTGTCAACGGACCACGTGCAATTTGCTCTCGGTCGTCTAGCTGGCGCGTTGAGGGTCAAGGAGGCTGTGTCGTGGCTCGCAGCTCCTCTCGCACCAAACTCCCCTCAGCCGCCCGCCATGCAGGACGCCTTCCTCCGAGAGTTCATGCTGGCACACCAGCAATTCGTTGAAACTTTAGAGGAGTTTAAGGAACACCTTCCAGTGTTGCCAGTGCCGTTGTTATCAGTAGAGGACACCGCGGTGTTATGCGTTGGTCCGATGCCTTTGTCGTCTCCGGGACGTATCGCCGCCTCTTCACTTTCTTTGCCGCCGCAGAGGAATTCCTCAAAAGATTTCCCATTCTGGCACAAATTGGAGGAGAATCTGCTTCAGGTGGCGCAAGGAAGCGTCCCTATGATATTTAAGCCAAGTATAGATTTATATACGCAGAAAACAGATAGCAACCCCATTGTACCCAAAGGAGAACCAATTCAGATTGCCATAACTTTATATAATCCTTTGAAAATTCCGATACTCTTGAAGGAGTTGGAATTGTTGTGGCAGTTCACTTTGGAAGCGGATAATACAGAGATTTCTAGCGATGAGATTTTAAACAACGAGCCGTTAATAGCTTCGGGACAAATTAAAGAGAGTAATGTTATACGTGGACAGAAATTGAAATCGTTCTTACTGGAAGGAGAGTGCAGAAAGACGCTAAATCTAACGGTCACACCCTTACAGACTGGACAGCTGTCCATACAGGGATTAGCGTTCAATCTGATAAACGTCGGAGAGGGAAAGAATAATGAAAATGGCGTTTCAGTGTTGGGTAAAGTAAACTTACAGAATGGTGCTAATTGTTCTGATAAATTGTTACTAATAACAGTCATACCACACGCGCCGTGTCTTCAGATGACACTATCGGAAACTGTGAGTGAAGTAATAAGTGGAGAAATATTAACAGTGGACGTGGATTTCTGTAACATTGGTCCGGTAACATTGAAAAACCTGTATTTAGCAGTCTCACACCCGGAGTGTATGGCGTGGCGTGGCGTGGTGGGCTCTAGTGATAACGTAGACGACTTTGGCGTTCTCTATGACGAAAAATATAGACCGCCCCCCGACTTTACAGAACATCACCCAGAAACCAAAGATGACATTAATAAGCCGCCCTATTCTAAATTTGTAACTGACTATATGTCTCCATTACTAGAAAATTTGGAATCTATTCAAAGTCCAGCGAATACTTCTGGCCTGATCCAATCCATGTTGGTTTTGAGATGGAAAGCAAACAATAGAAAGACTAATCGCAGAGTCGTCGGCCAACACAGCATCTGGATGGATTGCTTTACGAAAACACTATCAAGGGAAAGAGAAAAATTGCCCATCGAAGTCACAAGCGGAGTACAACTCGACGATTTAGACAGCGCCACAGATATAACCGATGTGAAGAGTAAAAATGATAATAACGACCTCGTCATCATCAAAGTCGAACATTCCAATCACATTAATCATAACTTCAAAGCAAATAAACTATGCCTGATACCCGTAGTCTTGAACATAGTCAATTGTCAAGGATCACCGGTAACAGTTTTTATTGATATGCATAAACAACAGAACAGAGATTCTTCGGGAGAGATTGGATGGGCCGGGGCCCTAAATAACGGTTTAGATGCGGATTCCAAAGAATTGGGTGTGAACGTGACTTTGGATAAGTTCGAGTCGAGAAGAGTTCAGGTTCGGGCTTTGTGTGCCGCTCCAGGAACATATCTGGTGGGTGGTGCGTTCCACGTCACTACCAAACACGACCAAATCCTCAACTCATACTTTCCTAATACAACTTCACTGTTGGTCGTTAAGCAGATGTAA

Protein sequence:

>DPOGS203676-PA
MSAVAPTIPGADPTKDIQKRRAGSIGSDEDDASGGKYTRMEEDNIQDKERFASRENHCEIERRRRNKMTAYITELSDMVPTCSALARKPDKLTILRMAVAHMKALRGTGNTSTDGTYKPSFLTDQELKHLILEAADGFLFVVSCDTGRIIYVSDSIAPVLNYSQGEWYSSCFYDQVHPDDLEKVREQLSTQEPQNTGRILDLKTGTVKKEGHQSSMRQVMGSRRGFICRMRVGGTAESAHLGRLRARNSLGPSHDGHNYAVVHCTGYIKNWPPTDLFPGMQMDRPVEDELHASHCCLVAIGRLQVTSTPSSAEGSACGGVEFVSRHSVEGRFTFADQRAAQVLGYAPADLLGKLCYDFYHPEDQQHMRDNFDQVLKLKGQIISLMYRFRTKNREWIWLRTSAFAFLNPYNDDVEYIVCTNTLANRSLGSTGGEPVADENYDYHLRQRDVYQAPPPPIHQQHHAPPGGGVGARSPGEAGGAGAAAAYAPHAPHYAPDYSPHRPANTPPHTTWTTLRPSGASGAGSGGESYAYSGDAAGAGAGNGSPARSPPAPAYLPPAHYHHNHHPPHPTHPTHPTHPTHPPHAGIWAWQGGAGGAGGPAAGAPEGGHAPHELSEMLQILDQGGAATFEDLNINIIVAMTKTAQEFIQNSFSPIIATLCSPKVDNIVCKNNLTLPELLQPFTQLDWEGQIREPGGNFCSIRNLNLVIRDVLWKPLQPTEARRQLNNAVLHNYDEKTVIMKIDTQIFDVPQATPWFDAWRETYLEVQFPSDHEFTKHFLACIIVATTFEDDIVETYNTLNQQYAQLQNVTPPKLPKWFNSSVLKSYLLLHDVSAISKEKADKMFETMKQTFGATNCFMLSINSKTSNEPKLTTDYWANYLKQSSESSVETVSLSSSSSIHDAQLTAQPGHVSLLDDRQNTLTTSTNADGVHPLTNTESDTYDNANSLQPYSFSGSSESVNVAGRQLERQSEIHGAALNANDIESMKSFLKDYVSKAFVPYLEKLIAQLNEVVANKKGVSRSLLSATKRWFTAGKSGTTTVNNTVIYSSDCPELQLRRLGDVWFMCGQWSRAFDSYHAAKREFYADSAWLCYAGALEMAAVSAFMAGDANRKTFGYMEESIVTYLNTCRMVQYAVRATLLSVPCLISAGFYGEAAKQLIRMTSEDSDLRSAMLLEQAALCFLKGPSTKIMSRKYAFHMVLAGHRFSKAGQKKHAYRCYKRAYQVYEDSGWRLSTDHVQFALGRLAGALRVKEAVSWLAAPLAPNSPQPPAMQDAFLREFMLAHQQFVETLEEFKEHLPVLPVPLLSVEDTAVLCVGPMPLSSPGRIAASSLSLPPQRNSSKDFPFWHKLEENLLQVAQGSVPMIFKPSIDLYTQKTDSNPIVPKGEPIQIAITLYNPLKIPILLKELELLWQFTLEADNTEISSDEILNNEPLIASGQIKESNVIRGQKLKSFLLEGECRKTLNLTVTPLQTGQLSIQGLAFNLINVGEGKNNENGVSVLGKVNLQNGANCSDKLLLITVIPHAPCLQMTLSETVSEVISGEILTVDVDFCNIGPVTLKNLYLAVSHPECMAWRGVVGSSDNVDDFGVLYDEKYRPPPDFTEHHPETKDDINKPPYSKFVTDYMSPLLENLESIQSPANTSGLIQSMLVLRWKANNRKTNRRVVGQHSIWMDCFTKTLSREREKLPIEVTSGVQLDDLDSATDITDVKSKNDNNDLVIIKVEHSNHINHNFKANKLCLIPVVLNIVNCQGSPVTVFIDMHKQQNRDSSGEIGWAGALNNGLDADSKELGVNVTLDKFESRRVQVRALCAAPGTYLVGGAFHVTTKHDQILNSYFPNTTSLLVVKQM-