Monarch geneset OGS2.0

DPOGS212701
TranscriptDPOGS212701-TA1095 bp
ProteinDPOGS212701-PA364 aa
Genomic positionDPSCF300012 - 810111-829811
RNAseq coverage278x (Rank: top 39%)
Annotation
HeliconiusHMEL0155362e-16684.70% 
BombyxBGIBMGA013132-TA4e-12185.20% 
Drosophilaorg-1-PA5e-8466.20% 
EBI UniRef50UniRef50_E0W3M91e-9276.62%T-box protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0W3M9_PEDHC
NCBI RefSeqXP_002432973.12e-9376.62%T-box protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420251184e-9276.62%T-box protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|910843031e-8955.52%PREDICTED: similar to T-box protein Tbx1 [Tribolium castaneum]
Group
Gene OntologyGO:00056343.5e-139nucleus
GO:00063553.5e-139regulation of transcription, DNA-dependent
GO:00037003.5e-139sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[37-304] IPR0016993.5e-139Transcription factor, T-box
[112-299] IPR0089674.3e-74p53-like transcription factor, DNA-binding
Orthology groupMCL15727 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212701-TA
ATGGAGAGCCAAGAGTGGCGCGAGGACTGGCACCAGCCGAGACAGATGGACGGCGTTGTGTTCAATCAGTTCCCGCGAGGTGGTTTCCAACTGCAGGCTTTAGCTGAGCGAGTCAGCCGCGACCAGGAACACCTGCCCGTACTGCCGCCGCTTTACAACGTAGTACGAGACACTGCTAGCTGTTCGCGGAGTACATACTCTCCGCTACAGGCGCTCAGTGAGAGCACGTGCCGAGACCACGCTGCGCCCGCGCCGCCCCCGCCCCCGCCACGTGTTCAGGAGGCCCACTCGCAGGTCCCGAATCAAAGCGTGACCCTCCACCCAGCTGTAGCTCGTTGCAGCGCTTCCTTGGAGCTATCAGCGTTATGGCGGAGCTTCCACGAGCTCGGGACGGAGATGATAGTGACGAAGGCCGGCAGACGGATGTTCCCAGCGCTCCAGGCGAGGCTCTCCGGTCTACTGCCCAATGCTGATTATCTACTGCTGGTGGATTTCGTACCGCTGGACGACAAAAGATACAGATACGCCTTCCACAGTTCGAGCTGGGTCGTGGCTGGCAAGGCCGACCCAGTGTCTCCGCCTCGTATCCACGTACACCCTGACTCGCCAGCGGCCGGAGCACACTGGATGAGACAGCTCGTCTCTTTCGACAAACTTAAATTGACAAACAATCAGTTGGACGACAATGGACACATAATCCTGAACTCGATGCACCGCTACCAGCCCCGGCTGCACGTGGTGTTCCTACCCGGAGACGGGCAGAGCGCCCCGGGGACGGTCCCCTACAGGACCTTCATCTTCCCGGAGACAGGGTTCACAGCGGTCACCGCCTATCAGAATCATCGCATAACTCAATTGAAGATAGCCAGCAATCCGTTCGCTAAAGGCTTCAGAGACTGCGATCCCGACGACTGTCCACCAGAGCCTGGCGGACAACGGGCCCCTCGGAGGCGCGAGGAGGGTCCGCTAGCGCAGCCCTACGCCGCTGAACCCTCGCGGCCGCCCGGCAACATGCCGCCCCACGCGCACACCGTAAGATACCAACCTCACTCAAGTCACAACAGCTCGTACACAGCGTATTACGCTCACAGATAA

Protein sequence:

>DPOGS212701-PA
MESQEWREDWHQPRQMDGVVFNQFPRGGFQLQALAERVSRDQEHLPVLPPLYNVVRDTASCSRSTYSPLQALSESTCRDHAAPAPPPPPPRVQEAHSQVPNQSVTLHPAVARCSASLELSALWRSFHELGTEMIVTKAGRRMFPALQARLSGLLPNADYLLLVDFVPLDDKRYRYAFHSSSWVVAGKADPVSPPRIHVHPDSPAAGAHWMRQLVSFDKLKLTNNQLDDNGHIILNSMHRYQPRLHVVFLPGDGQSAPGTVPYRTFIFPETGFTAVTAYQNHRITQLKIASNPFAKGFRDCDPDDCPPEPGGQRAPRRREEGPLAQPYAAEPSRPPGNMPPHAHTVRYQPHSSHNSSYTAYYAHR-