Monarch geneset OGS2.0

DPOGS213780
TranscriptDPOGS213780-TA861 bp
ProteinDPOGS213780-PA286 aa
Genomic positionDPSCF300212 + 484128-486779
RNAseq coverage407x (Rank: top 30%)
Annotation
HeliconiusHMEL0022642e-4949.64% 
BombyxBGIBMGA009265-TA1e-11974.83% 
DrosophilaNf-YA-PA2e-2370.27% 
EBI UniRef50UniRef50_D1ZZH56e-4645.88%Putative uncharacterized protein GLEAN_07453 n=1 Tax=Tribolium castaneum RepID=D1ZZH5_TRICA
NCBI RefSeqXP_972706.21e-4645.88%PREDICTED: similar to nuclear transcription factor Y, alpha like [Tribolium castaneum]
NCBI nr blastpgi|1892364332e-4545.88%PREDICTED: similar to nuclear transcription factor Y, alpha like [Tribolium castaneum]
NCBI nr blastxgi|1892364331e-4946.18%PREDICTED: similar to nuclear transcription factor Y, alpha like [Tribolium castaneum]
Group
Gene OntologyGO:00056341.7e-32nucleus
GO:00063551.7e-32regulation of transcription, DNA-dependent
GO:00037001.7e-32sequence-specific DNA binding transcription factor activity
KEGG pathwaytca:6614583e-46 
 K08064 (NFYA)maps-> Antigen processing and presentation
InterPro domain[4-274] IPR0012891.7e-32CCAAT-binding transcription factor, subunit B
Orthology groupMCL16307 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213780-TA
ATGGAGCATGCACAGATCCAAACGTCAGATGGTACACAAGTGATGACCATCAATGGGCAACAGCTGCAAGTGCTGCAAATGAACAACAGTCCCCACATGATTCAAGGTCCTAATGGACAACAAATAGTCATACATGCTATACCATCAACAGCACCGACCATACAGGTGGCAACCCCGACTGGTCAGCAATTACAGCAACTACAAGTACTACCACTATCTAGTTTACAAGGTACAAGCAACAATATACAGCCGATGCAGTTAGTTCAGACTCCTGACGGACAAACATACATCTATCAACCCACAGCAAGCGCACCACAACCACAGATTGACCAACATCAGATTATACACCAACCCGGCACCCTATTGAACTTGAATGGAAATTTACTACAAGTGGCAGGTGTTGGCGGAGTACAACCACAACAAATTGTCAATCAGCCGAACATAGTCATGATGGTAAATGGCAACACAGCCTCATCGAGCGAAGGGGCCACATCCACAGCGGGGTCTGACGAGGAGCCGTTACTGTATGTCAACGCCAGGCAATACAAACGTATACTGAAGAGACGAGCAGCTAGGGCTAAATTACACGAACAAGGGAAAATACCTAAAGAAAGACCTAAATATCTTCATGAGTCGAGACACAGGCACGCCATGAACAGGATAAGAGGTGAAGGTGGGAGATTTAATTCCGGCAGCAGGAAGAATATGGAACAACAGGAACAGAATACGTCCACCCAGGCTATATTGGATGACATCAAGCCCGATACAGTCTCTATAACAATAATTCAGGATGAAGAATTACAAGAGACCCAAACGAATCAATGGCGAAGACTGGCCCCACAACCGATGACATCGACATAG

Protein sequence:

>DPOGS213780-PA
MEHAQIQTSDGTQVMTINGQQLQVLQMNNSPHMIQGPNGQQIVIHAIPSTAPTIQVATPTGQQLQQLQVLPLSSLQGTSNNIQPMQLVQTPDGQTYIYQPTASAPQPQIDQHQIIHQPGTLLNLNGNLLQVAGVGGVQPQQIVNQPNIVMMVNGNTASSSEGATSTAGSDEEPLLYVNARQYKRILKRRAARAKLHEQGKIPKERPKYLHESRHRHAMNRIRGEGGRFNSGSRKNMEQQEQNTSTQAILDDIKPDTVSITIIQDEELQETQTNQWRRLAPQPMTST-