Monarch geneset OGS2.0

DPOGS214934
TranscriptDPOGS214934-TA1509 bp
ProteinDPOGS214934-PA502 aa
Genomic positionDPSCF300280 - 192876-194384
RNAseq coverage675x (Rank: top 19%)
Annotation
HeliconiusHMEL0155980.073.45% 
BombyxBGIBMGA004818-TA0.073.19% 
Drosophiladan-PA1e-4931.21% 
EBI UniRef50UniRef50_Q16IB45e-5431.47%Protein distal antenna n=1 Tax=Aedes aegypti RepID=DAN_AEDAE
NCBI RefSeqXP_001663918.11e-5431.47%Psq-DNA binding domain protein, putative [Aedes aegypti]
NCBI nr blastpgi|2065579382e-5331.47%RecName: Full=Protein distal antenna
NCBI nr blastxgi|2065579389e-5832.83%RecName: Full=Protein distal antenna
Group
Gene OntologyGO:00036771e-19DNA binding
GO:00007751e-19chromosome, centromeric region
GO:00055151.8e-15protein binding
GO:00063551.2e-06regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[6-57] IPR0066951e-19Centromere protein Cenp-B, DNA-binding domain 1
[5-68] IPR0090571.8e-15Homeodomain-like
[16-63] IPR0122871.2e-06Homeodomain-related
Orthology groupMCL18292 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214934-TA
ATGACAACGAAGGGAAAGCGTCCTATGCGCGCCCTCACACCCGGAGATAAGATCGAGGCCATACAGAGGGTCAACGACGGCGAGTCCAAAGCCTCGGTCGCTCGTGACATAGGAGTGCCCGAGTCCACGCTGCGGGGCTGGTGCAAGAATGAGGACAAGCTCCGCTACATGACCTCGAGGTTGTCCTCCCCCGACACCGACAAGAGCAACGACGGGGAGCCGCCGGACAAGCGCGCGCGCACCGAGTCCCCGACAGCTCCTCAGTCACCCATCAACACCGGCCTGGATCTTTCTAGCGCGGTCTCCGTTACTCACAGCACCACCCAGCCTCCGCCGCAGGCCGATGTCCCCGTCGAGCTCACTACCAAGCGCAGCGAGCCCTCGCCCCCACTCCATCCGCCACGAGAGCGCCGACCGGACCCCGGAGCCAGCGTCTCCATGAGCGCCATCAGCCCGCTATCGGGATTGGCCCATCTACCAGGACTTACACATTCTCACCTCGGATTGAGCTTCAATGAAATCGCAAACAACCTAACACTCCTCGCTCAACTGAACCCTGGACTGTCGACGCTGTCGGCGCAGCCGGCGAGCAGAGCGCTGCGGTCTGTGCGCTCGCCGAAGCCAGCTCACAACGGAGTGCTTAACTTGAACGAAAACAAACATCGCAGCAAATCAAACCACTCATCGGACCCGTACAGACACAGCGGGTCCAAGTCGAGTCATCACACTACTTCGCAATCAGCGTCTCAGCCCGTCGACGACACGCTGTGGTACTGGCTCAAAACTCAACAGGCCATGCTGGATTTGACTTCCCAAACAACGGCTCATCCGTTGCAATTAGGAAAAACTAGCGATCCCACTCTGCCGCCTAAGCCCGTGGCGCCCACGCCGCCCGTCAGTTCGCACCTCGACTACAACAGAAACTCTTGGCTGTGGCAGTACTACAAACAGTTCGGTGGAGCCATGCCGGTTCCGGAAGACAAGCACAAGCCGGCGTCACAGGTACCGAAAGACAAGTCCGGAGACATCTTGTTCTCGCATTTAACTAAAGCGAAGCCGGAGGACGACCGGAGCATCATTAGTCCAGACCAGAGCCAAACTCTGTCGGCTAAAGTCCGCGAGACAGTCCCGCCGCCGCTCCCGGCCGCTCCCGCAGAACCTCGAGTGGCCGAGCCCGCCTCGAGCCCGGACGTCGGCACGGAGAATAAGGAACCCTCAGTAGAGAAACCCATCGAGTCCGGCAGAAGCCAAACCAAGGCCAGAAACGTGCTCGACAACTTACTGTTCAACAGCAGCCAAGCGGCCAACGAAGAGAATAAGAGCAACGGCTCCACGAACGGCGAGTGGGAGGCGGGCACGGTGGAGGCGCTGGAACACGGAGACAAGTTCCTCGCGTGGCTGGAGGCCAGCGGCGACCCGAGCGTCACCCGCATGCACGTGCATCAGCTCCGAGCACTGCTCCACAACCTCCGCACGCGCCGCGCCGCGCCCGACGCACGCCGCAAGTAA

Protein sequence:

>DPOGS214934-PA
MTTKGKRPMRALTPGDKIEAIQRVNDGESKASVARDIGVPESTLRGWCKNEDKLRYMTSRLSSPDTDKSNDGEPPDKRARTESPTAPQSPINTGLDLSSAVSVTHSTTQPPPQADVPVELTTKRSEPSPPLHPPRERRPDPGASVSMSAISPLSGLAHLPGLTHSHLGLSFNEIANNLTLLAQLNPGLSTLSAQPASRALRSVRSPKPAHNGVLNLNENKHRSKSNHSSDPYRHSGSKSSHHTTSQSASQPVDDTLWYWLKTQQAMLDLTSQTTAHPLQLGKTSDPTLPPKPVAPTPPVSSHLDYNRNSWLWQYYKQFGGAMPVPEDKHKPASQVPKDKSGDILFSHLTKAKPEDDRSIISPDQSQTLSAKVRETVPPPLPAAPAEPRVAEPASSPDVGTENKEPSVEKPIESGRSQTKARNVLDNLLFNSSQAANEENKSNGSTNGEWEAGTVEALEHGDKFLAWLEASGDPSVTRMHVHQLRALLHNLRTRRAAPDARRK-