Monarch geneset OGS2.0

DPOGS200190
TranscriptDPOGS200190-TA2469 bp
ProteinDPOGS200190-PA822 aa
Genomic positionDPSCF300360 + 44533-80358
RNAseq coverage1681x (Rank: top 8%)
Annotation
HeliconiusHMEL0098927e-14278.51% 
BombyxBGIBMGA005097-TA3e-12877.32% 
Drosophilasima-PA2e-8741.37% 
EBI UniRef50UniRef50_D6WMG71e-13247.08%Hypoxia inducible factor 1, alpha subunit n=3 Tax=Pancrustacea RepID=D6WMG7_TRICA
NCBI RefSeqXP_967427.21e-13347.08%PREDICTED: similar to hypoxia-inducible factor 1 alpha [Tribolium castaneum]
NCBI nr blastpgi|1892376692e-13247.08%PREDICTED: similar to hypoxia-inducible factor 1 alpha [Tribolium castaneum]
NCBI nr blastxgi|1892376695e-12842.24%PREDICTED: similar to hypoxia-inducible factor 1 alpha [Tribolium castaneum]
Group
Gene OntologyGO:00055155.3e-14protein binding
GO:00071651.1e-10signal transduction
GO:00048711.1e-10signal transducer activity
GO:00063558.6e-10regulation of transcription, DNA-dependent
GO:00056346.4e-05nucleus
GO:00037006.4e-05sequence-specific DNA binding transcription factor activity
KEGG pathwaytca:6557723e-133 
 K08268 (HIF1A)maps-> Pathways in cancer
    mTOR signaling pathway
    Renal cell carcinoma
InterPro domain[219-303] IPR0136555.3e-14PAS fold-3
[62-128] IPR0000141.1e-10PAS
[69-128] IPR0137678.6e-10PAS fold
Orthology groupMCL10512 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200190-TA
ATGCAAATCTTCGCGGAGCTGACAGCATCACTACCAGCTAAGAAGGAAGACGTCGAACAGTTGGACAAAGCTTCCATAATGAGACTCGCCATATCATATCTACGTGTGCGTGATGTTGTGTCTATGTTGCCAGAAGATAAGGAGGTTCCGTTAATGCAGAGTCCCAAAGGCCTGGAGGAGATCCAATCAGAGCTTTCATATATGAAAGCGTTGGATGGATTTGTTCTGGTCCTATCCCAACAGGGCGACATCGTCTACTGCAGCGAGAATGTTGCTGAGCATCTTGGAGTTTCACAGATGGAGGTCATGGGTCAGAGTGTGTTCGAGTTCAGTCACCCCTGCGACCACGAGGAGGTGAGGGAGGCTCTGAGGTCAAGTAAAGATGGCAAGAGAGACCTATTGTTGCGTCTCAAGTGCACTCTCACCAGTAAAGGGAGGAACGTTCATCTCAAATCTGCTTCTTATAAGGTGATCCACGTAACTGGACACATGCTGACAGAAGAAAATCAAACTGATGGAGATAAAGATACAAAGAAAATCGGCAAATCCGCGCTCGTAGCTGTGGGAAGACCGATACCTCATCCATCCAACATAGAAACTCCTTTGAACAATATGACTTTCCTCACAAAGCACAGCTTGGACATGAAATTCACTTATAGTGATGAAGGTCTTCAAAATGCCCTTGGATATGATTCCAATGATCTGGTCGGTCATACCCTGTATGACTACCATCATGCAGGCGACAGCGCTGTACTGCTGCAGCAATTTAAATCATTGTTCTCTAAGGGACAATGCGAGACCGGACAATACCGATTCCTCGCCAAGAAGGGCGGCTACGCCTGGGTCCAAACCCAGGCCACCGTCATCACAGACAAGCAACAAAAGCCGATTAGCGTCATCTGCGTCAACTACGTCATCAGCACAGGTGATGTAATTCAAGATTGTCGGACGTTGCCTCGGTCACGTACAGACTCGGAGCAAGTTGAAAACGAGGGTGCTGTGAGCGGTATCGAGTGCAAAGATGAGGTGTTCGCTGCACACCAAGTGCAGCACGCAGACTTGAAGCCTGCCGTGGCTCCGACTATACCAGAAGCTAATCCAGCACAGATTTGTGTGGCCACTGAACCATCTAACGGTGCTATAGTGGCCGCTGTCCTTCCAGAAGAAGAGCGCCCTGTACCCGTTACTGAACTTATATTTGCACCCAGAAAGAAGGAAATGAATAAAGGATTTCTCATGTTCTCTCCTGACGAGGGACTTACAATGCTTAAAGACGAGCCGGAGGACCTCACACATTTGGCGCCGACGGCTGGAGATGCTTGCATTCCTTTAGAGAACAGCCCTTTTGACATGTTTGACGAATTCATTTTAAGTGACAATTATTGCAGTCTACTTGGTGATGATCTGACTAGTGGATCACCAGTAGATTCGTTGATAGCGGATTCCTTACTCTCGTCCCCAGAGCCACAGGAAACCGAATCATCGTGCGAACAGTCGTCGCTTCTGAACGAGTTGTCTTTGGATGCGTTTGATAGTAACAAGTCTGAGAATGACATCGACGATGGACATTCACCATTCATTCCCACTACTGACGAGCTTCCCCTTCTGGAGCCAGCAGTTATGTGGGGGGCTCTGCCTGACAACGTGTGCCAGGCTAGACCTCAACCGACCGAAGTTCAAAGCCCCGCACCAGCGTTGCAGCGCTTACTAGCAGCGCCACCGACCGGGCCACGACCGCAAGATCTCATCACAAATATATATTCAGATCAAGGTCTTATTCCAAGCAAGATATCAACATGGGACACTGGTGTTAAGCGTGTTTTGACCAAAGACGAAGAGCCTTCAGCGAAACGCGTGAAGCGTAGTCCGTCACCGACAACGAATCAGACCTCAAGCGTCCTTATGAACCTCCTGGATGTCCATAATAATGGAAAGACGACGAATCAGCTGCCAAAATATCAGATGCTAGTCACTCCACCGACAGCCTCGCCTCAAAGCCCAATCCGAAACATCCCGGTGCCGGTCATAAACATGATGCAGCCCAATCAGCAACTCCGAAACACGATAACCAATCAAATAAGCACAAACATATCAAATCCAATGTCCCCACTGACTTTAAACGTCGGAAGTCCATTATACTCCCTGCCATCAAGTCCAGCTATAAGTCCAATCCAAAGAGATCGCGTTCTGAGCCCCTACTCCACTCCCCAATCTTTATCACCCGCCGGTAGTTATCAAATGTACAGCCCGAATAGCAACATTCTCCTATCACCCTCCGGAGTAATGCAAGGTTATGATCCGTATTTGACGAATAAAATGCAAACTAGCCCCGGATATCCTCTCCAGACGTCAGATATGCTGATGGATTCCAACATCCAGCTGCAATCTGCCGACTTCTGGTCTGATTCTGAGATGCTGCAAGGCACGAGCGATCTCCTCACAGCATTCGACGACGTCAAATTGGTGTAA

Protein sequence:

>DPOGS200190-PA
MQIFAELTASLPAKKEDVEQLDKASIMRLAISYLRVRDVVSMLPEDKEVPLMQSPKGLEEIQSELSYMKALDGFVLVLSQQGDIVYCSENVAEHLGVSQMEVMGQSVFEFSHPCDHEEVREALRSSKDGKRDLLLRLKCTLTSKGRNVHLKSASYKVIHVTGHMLTEENQTDGDKDTKKIGKSALVAVGRPIPHPSNIETPLNNMTFLTKHSLDMKFTYSDEGLQNALGYDSNDLVGHTLYDYHHAGDSAVLLQQFKSLFSKGQCETGQYRFLAKKGGYAWVQTQATVITDKQQKPISVICVNYVISTGDVIQDCRTLPRSRTDSEQVENEGAVSGIECKDEVFAAHQVQHADLKPAVAPTIPEANPAQICVATEPSNGAIVAAVLPEEERPVPVTELIFAPRKKEMNKGFLMFSPDEGLTMLKDEPEDLTHLAPTAGDACIPLENSPFDMFDEFILSDNYCSLLGDDLTSGSPVDSLIADSLLSSPEPQETESSCEQSSLLNELSLDAFDSNKSENDIDDGHSPFIPTTDELPLLEPAVMWGALPDNVCQARPQPTEVQSPAPALQRLLAAPPTGPRPQDLITNIYSDQGLIPSKISTWDTGVKRVLTKDEEPSAKRVKRSPSPTTNQTSSVLMNLLDVHNNGKTTNQLPKYQMLVTPPTASPQSPIRNIPVPVINMMQPNQQLRNTITNQISTNISNPMSPLTLNVGSPLYSLPSSPAISPIQRDRVLSPYSTPQSLSPAGSYQMYSPNSNILLSPSGVMQGYDPYLTNKMQTSPGYPLQTSDMLMDSNIQLQSADFWSDSEMLQGTSDLLTAFDDVKLV-