Monarch geneset OGS2.0

DPOGS206748
TranscriptDPOGS206748-TA1248 bp
ProteinDPOGS206748-PA415 aa
Genomic positionDPSCF300316 - 26798-31199
RNAseq coverage452x (Rank: top 27%)
Annotation
HeliconiusHMEL0111837e-12453.35% 
BombyxBGIBMGA009727-TA1e-7941.49% 
DrosophilaCG31301-PA6e-3429.10% 
EBI UniRef50UniRef50_E2BVT21e-4732.78%NF-kappa-B-repressing factor n=7 Tax=Harpegnathos saltator RepID=E2BVT2_HARSA
NCBI RefSeqXP_001600262.11e-4331.55%PREDICTED: similar to GA16162-PA [Nasonia vitripennis]
NCBI nr blastpgi|3071997314e-4732.78%NF-kappa-B-repressing factor [Harpegnathos saltator]
NCBI nr blastxgi|3071997314e-4732.78%NF-kappa-B-repressing factor [Harpegnathos saltator]
Group
Gene OntologyGO:00056221.4e-10intracellular
GO:00036761.4e-10nucleic acid binding
KEGG pathway 
InterPro domain[1-65] IPR0218591.8e-16Protein of unknown function DUF3469
[274-315] IPR0004671.4e-10D111/G-patch
Orthology groupMCL11126 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206748-TA
ATGGAACGATGGAAAAACATGTATCCTGAAGATAGGCTCGTCTGTTTGGCCAGAGTATTCATTAACATAGAGTTTATGGGCTGTCGGTATCCAAACGAAGTTATGCTAGAAGTTTCTAGATTATCGAACGAAGTTGCAGAAGAATATAGAAAAATGAAGAAGATGAAGCTACAAAGAACCTTCGTATCCGCCTCTGAGGCAGCTGCAGTTAAGGCTAAGGGTAAAAAACGAAAAGGTGGACTCGTAAAAGGGAAGCCTCCGAACAAAGCACCCAAAATAGACTTTGTACCCCAAGGACAGCAGGTGCATACAAAAATTAAAAATGAAAATAATGAGAATCAACCAAATAATGTTGATTCAACAACAACCGACAGTGAAACTGAGACCAAAATCTCGCAAACGACAAGCAAACCTATATCTATTGATTACCTAAAAGAATTATCGAAGGTAAAGTGCGTGGACGTCAGCAAGTTTGATGACACGATGTTCGAAACGCCGTTCGGGAGATTTGTTCTATTGATAAACACGAGTATGACAAAGTTGGGGAACATACAGAGTAGCTGTCAGGCTTGCAAACTGAATACCACATTTTCGTATGAGGACAATGTGTACACCATTCATATAAACGAAGACTTGATAGCCGAGGCCCCCGGAACTACTAAAGCATTGGCCAGAGAGGCTGCCGAGAAATTAGCTTGGAAAAAACTGAAAAAGCACTGCGTGTGCTTACTGGTCAGAGAAAACAAGAGCAATAAAATAGATAAATTAAAAATTAACGAAGTCTTCAGGAAGAAAGAGGATTCTGGGACAAAGGTTGAGAACAGCGTTGCTGTCAAAATGATGAAGCTCATGGGGTGGAAGGGAGGAGGTCTGGGTGTCGATGCCCAGGGTATCCAAGAACCTATACAACCACATCTACAGACGGGTAAACGATCTGGTTTGGGTTCGACGCCGGGTATGCATCACATCAGGGCGGCGGGCACGAAACTAATGAAGCGTCTCCAAGCCTCGGATGACTTTGACGTGGAACTTGTGTTCACGAACGAATTCTCAAAAGAGGAGAGGGCGGCGTTACACAAGTGCGCCCAGAACCACGGGCTCGTCTCTAAAAGCTACAACAGTAACAGTCAGAGATTTCTGGTGGTAAAGAAGAAATTGGATCCGTTCTCACTAGTGAAAGCTGCTATTGAAAAAGGCGGGGACACACCCAAATATAAAGTGTTCATACCGGCCGTGCTGGCAAAATGA

Protein sequence:

>DPOGS206748-PA
MERWKNMYPEDRLVCLARVFINIEFMGCRYPNEVMLEVSRLSNEVAEEYRKMKKMKLQRTFVSASEAAAVKAKGKKRKGGLVKGKPPNKAPKIDFVPQGQQVHTKIKNENNENQPNNVDSTTTDSETETKISQTTSKPISIDYLKELSKVKCVDVSKFDDTMFETPFGRFVLLINTSMTKLGNIQSSCQACKLNTTFSYEDNVYTIHINEDLIAEAPGTTKALAREAAEKLAWKKLKKHCVCLLVRENKSNKIDKLKINEVFRKKEDSGTKVENSVAVKMMKLMGWKGGGLGVDAQGIQEPIQPHLQTGKRSGLGSTPGMHHIRAAGTKLMKRLQASDDFDVELVFTNEFSKEERAALHKCAQNHGLVSKSYNSNSQRFLVVKKKLDPFSLVKAAIEKGGDTPKYKVFIPAVLAK-