Monarch geneset OGS2.0

DPOGS201107
TranscriptDPOGS201107-TA1110 bp
ProteinDPOGS201107-PA369 aa
Genomic positionDPSCF300137 - 392422-398198
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0053126e-11892.28% 
BombyxBGIBMGA013687-TA3e-10986.82% 
Drosophilaslbo-PA3e-2863.11% 
EBI UniRef50UniRef50_Q38Q346e-10786.82%Chorion specific C/EBP n=2 Tax=Obtectomera RepID=Q38Q34_BOMMO
NCBI RefSeqNP_001037374.11e-10786.82%chorion specific C/EBP [Bombyx mori]
NCBI nr blastpgi|1129837302e-10686.82%chorion specific C/EBP [Bombyx mori]
NCBI nr blastxgi|1129837302e-13486.94%chorion specific C/EBP [Bombyx mori]
Group
Gene OntologyGO:00063552.7e-14regulation of transcription, DNA-dependent
GO:00435652.7e-14sequence-specific DNA binding
GO:00037002.7e-14sequence-specific DNA binding transcription factor activity
GO:00469832.7e-14protein dimerization activity
KEGG pathway 
InterPro domain[278-328] IPR0117002.7e-14Basic leucine zipper
[276-340] IPR0048273.4e-11Basic-leucine zipper (bZIP) transcription factor
Orthology groupMCL16023 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201107-TA
ATGATGACTCGTTTTGAGTGGGAGCTGAAGATCGCGGGCCGCGAGCAGGCCTGGGGACCAGGGCCCGAAGACTTTTACTCGCGAAGTTCTCGAGCAGCTTCCTTCCGGTCGCTGGACGGCGGACAGGCTTGCGCGCGCGCCTCGCCAGTTAGCGGCCGGCCCTCGGAGCGAACGCACGTGACGCGGAGTGCATGTGCTGTGACACGGAGGACTGTGAAGTGGAGACCCGCACAGGAACCGCTGGGCCCCTCGGCCCCGGCGGGATGGAATCCCCGCAAATGTACGACCAGGCCGCGGCGCCTGGCCGCGGCGCCGCAGCCGCCGCCGCAACCAGACCTCAAGAAGGCCAACGAGGACAAACGGAATGCCTTCCCCCCGCCCGACCTGGACGAGCTCAATGGCCAGGAGATCAGCCTGGACCTGCAGCACCTCATCGAGGACCAGTTCCGGGGGGAGGAGACTATGGCCCTGTTCCAGGAGATACTGCCCGGGGCGAGATCCCCGCAGCAGAGATCGTTCCCGCGGACTCTGGCCTACATGCCGCAGCCCGTGCACTCGGGGGCCTCGTACGTAGCGCCCGTCCCCAACAACAACCACGAGCAGGCGCCGCCAATAAAAGAGGAGCCCCCGGAGCCCCACGACTTTAGGAGATCTGTTTCGTGTGCTCAGTATACGGGTCAATATAACCCCCAGCCTCCCGTGGGTGTGAGTGGTCCTTACGGTGGAGGATTCACGCCACTGCCTCCCCTGGGAGCACCCCTGCTGCCTCCCATGTTGAAACACAAGCCCGCTCCGGCCAGAAGGTCGTCGGGCAAGGTTCTGGACAAAGGTACCGACGAGTACAGGAGGCGAAGGGAACGAAACAACATCGCAGTACGGAAGTCACGCGAGAAGGCAAAGGTCAGGTCCCGGGAGGTGGAGGAAAAGGTGAAGACGCTGCTGAGAGAGAAAGAGGCTCTGCTGAAAAGGCTGGAGGCCGTGACGGGCGAGCTCAGCCTCCACAAACAGATGTACGTCCACCTCATAAACCTGAACCACCCTGAGATCACGGAGTTGTGCCGGTCGATGCTGCAGCTGGGTGCTCCCCATGGAAACGACCACACGCTCTGA

Protein sequence:

>DPOGS201107-PA
MMTRFEWELKIAGREQAWGPGPEDFYSRSSRAASFRSLDGGQACARASPVSGRPSERTHVTRSACAVTRRTVKWRPAQEPLGPSAPAGWNPRKCTTRPRRLAAAPQPPPQPDLKKANEDKRNAFPPPDLDELNGQEISLDLQHLIEDQFRGEETMALFQEILPGARSPQQRSFPRTLAYMPQPVHSGASYVAPVPNNNHEQAPPIKEEPPEPHDFRRSVSCAQYTGQYNPQPPVGVSGPYGGGFTPLPPLGAPLLPPMLKHKPAPARRSSGKVLDKGTDEYRRRRERNNIAVRKSREKAKVRSREVEEKVKTLLREKEALLKRLEAVTGELSLHKQMYVHLINLNHPEITELCRSMLQLGAPHGNDHTL-