Monarch geneset OGS2.0

DPOGS208226
TranscriptDPOGS208226-TA1665 bp
ProteinDPOGS208226-PA554 aa
Genomic positionDPSCF300079 - 562877-564820
RNAseq coverage3921x (Rank: top 3%)
Annotation
HeliconiusHMEL0083754e-11152.87% 
BombyxBGIBMGA006415-TA6e-9445.60% 
Drosophila% 
EBI UniRef50UniRef50_Q538A51e-9145.60%Chorion b-ZIP transcription factor n=1 Tax=Bombyx mori RepID=Q538A5_BOMMO
NCBI RefSeqNP_001037099.12e-9245.60%chorion b-ZIP transcription factor [Bombyx mori]
NCBI nr blastpgi|1129829384e-9145.60%chorion b-ZIP transcription factor [Bombyx mori]
NCBI nr blastxgi|1129829387e-9944.30%chorion b-ZIP transcription factor [Bombyx mori]
Group
Gene OntologyGO:00063559e-08regulation of transcription, DNA-dependent
GO:00435659e-08sequence-specific DNA binding
GO:00037009e-08sequence-specific DNA binding transcription factor activity
GO:00469839e-08protein dimerization activity
KEGG pathway 
InterPro domain[486-534] IPR0117009e-08Basic leucine zipper
Orthology groupMCL25241 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208226-TA
ATGGTTGCTCAAAAGATCACCGTCAGTCCGCGTCAAGGAGACGATAACGGAACAAAGTTCAATTCAAACCAAAGACAAGGTTTTTTCGCAAACGAGTCTCAAGGCTCGACGTTCTCTGAAACTAATAGACATAGATCGCTTAAAGTTCCAAAAGAACCAACAGGTGAACGGTCAGCGCGGAGTAATCCCCAACGCAAGAACAGAATTGCCAACATCTACTCCAACGCCAACAACCAGCTTTTCAGAGTTAATTCGTCAATTAACATATCGTATCATCGATTTCAGGAATTTTTGAAAACGGACCTGCTATTCACCAACACAATGTTCCACGACATGGATTGCGAGTTCTTCCAAGATCTGGTGCAATTGACTTCAGCGTCCGCCGAGGAAGGAATAGTACAGTCCATTGATTCTCAAAAAATAACCGAAAAGGCAAGGGCATACCACACAGACACACAGCACACACCATTCTCACCGCAAGGTTGGGATGTTTCGGACGCTAATTCGCCGTCAGCTTCGACTCATAATCAAACGCAGAGCTATCCCGTCTCTCCCACCGACGAGGGGATGGCTACTGGCTTTAATACTGATGTATTTTACAATTCAAATGCTATCAACGAAGTTAACATACCGTTCCAAGAACAGTTCTTAGACATCAGTACACTGCCGTTGACGATAGGTGATTTAGCGCCGGATAGTGTTGCCGAAACGAATCCGTGGCAGGCCGCGGAATTCACTTGGCAAAACACTGACGACTTAGACTACACAAAACCAACATCGAACATACACACTATGCCCTTCATAGACCAAGAGGACTCCCTGGACACGAAATTCATATCTGTTATACCTAGGGAGGTTGAGAGCAATGACGTAGTTATATCAGAGTACATCATTAATGAGACGCCAGAAACTAAACCGGCTGCCGCCCACAACTATACTGTCCAAGAGAGGCGGAGTGGTGGGCTGTCGCTAGATGTCGCCAGGTATCCACACAACTGGCAAGGAGAAGTCATCAGCACACCAGAGGTTCTTAGTTTTGTCGAGCAATTGGAGAAAGAGAAATGTACACTCCCAAGCTCGCATACTTTGGCCACTGATCATACAGCCTTCACGGAGAACACGATTATTGAAGATTCCCCTCCGGCCCCTGTCGACTATGAACCTATCACACCCAAAAGTGAACCGCAAATTGATTCGGACGAGGATATAAAACCAGGACCGAGGAAGAGAAGGCGGAACGACAGCGAGGACTCGGACGTATCATACACTCCTTATTCTGTTAGCAGTTCACCTAGAAAGTATAGGAAAAGAAAGCCTAGCATACCAATCAAAGACATGATCAGAGCTCTAGAAGGCGCCCAGCAACAAACCAAAGCTAGAAGGGGCCGACCACCCAAGAGGAGGGAAAGCGACGTGTCGGCTGTAAGCGAAAACTCGTCATCCACACACGAGATAAGTTACAGGAAACTCCGGGACAAAAATAATGAAGCATCCAAGAGATCGAGAATGAACAGAAAGCTGAAAGAACTGCAAATGGAACAAATGGCCGTCGACTTGGAAGAAAGAAATAAGAGACTGCGAATAAGGGCAGAACTATTAGAAGAAGCGACGAAGAAGCTTCGAGATGCCTTCATGTTAGCGGTGTCACAGAAAAAGGCTGGTTAA

Protein sequence:

>DPOGS208226-PA
MVAQKITVSPRQGDDNGTKFNSNQRQGFFANESQGSTFSETNRHRSLKVPKEPTGERSARSNPQRKNRIANIYSNANNQLFRVNSSINISYHRFQEFLKTDLLFTNTMFHDMDCEFFQDLVQLTSASAEEGIVQSIDSQKITEKARAYHTDTQHTPFSPQGWDVSDANSPSASTHNQTQSYPVSPTDEGMATGFNTDVFYNSNAINEVNIPFQEQFLDISTLPLTIGDLAPDSVAETNPWQAAEFTWQNTDDLDYTKPTSNIHTMPFIDQEDSLDTKFISVIPREVESNDVVISEYIINETPETKPAAAHNYTVQERRSGGLSLDVARYPHNWQGEVISTPEVLSFVEQLEKEKCTLPSSHTLATDHTAFTENTIIEDSPPAPVDYEPITPKSEPQIDSDEDIKPGPRKRRRNDSEDSDVSYTPYSVSSSPRKYRKRKPSIPIKDMIRALEGAQQQTKARRGRPPKRRESDVSAVSENSSSTHEISYRKLRDKNNEASKRSRMNRKLKELQMEQMAVDLEERNKRLRIRAELLEEATKKLRDAFMLAVSQKKAG-