Monarch geneset OGS2.0

DPOGS206528
TranscriptDPOGS206528-TA2250 bp
ProteinDPOGS206528-PA749 aa
Genomic positionDPSCF300190 - 275579-281160
RNAseq coverage298x (Rank: top 37%)
Annotation
HeliconiusHMEL0022970.082.38% 
BombyxBGIBMGA005916-TA0.075.78% 
Drosophilakat80-PB4e-6938.84% 
EBI UniRef50UniRef50_D6WK160.043.51%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WK16_TRICA
NCBI RefSeqXP_966378.26e-17542.05%PREDICTED: similar to katanin p80 subunit, partial [Tribolium castaneum]
NCBI nr blastpgi|2700074860.043.51%hypothetical protein TcasGA2_TC014074 [Tribolium castaneum]
NCBI nr blastxgi|2700074860.043.51%hypothetical protein TcasGA2_TC014074 [Tribolium castaneum]
Group
Gene OntologyGO:00055152.9e-77protein binding
KEGG pathwayago:AGOS_AEL246C1e-33 
 K03130 (TFIID4, TAF5)maps-> Basal transcription factors
InterPro domain[10-308] IPR0159432.9e-77WD40/YVTN repeat-like-containing domain
[13-295] IPR0110469.7e-73WD40 repeat-like-containing domain
[133-172] IPR0016805.2e-12WD40 repeat
[136-172] IPR0197817e-12WD40 repeat, subgroup
Orthology groupMCL13590 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206528-TA
ATGGAAACCTTAAGAAGATCATGGAAATTACAAGAATTTGTTGCCCACAAGGCAAACGTCAACTGCTTGGCTATGGGACACAAATCAAATCAAGTCCTTGCTACTGGTGGAGATGACAAAAAAGTTAATCTTTGGGCTATTGGTAGACAAGGGTGTTTAATGAGCCTTAGTGGTCACACAACTCCAGTAGAGTGTGTTTGCTTTGGTCACTCTGAGGACTTAGTTTGTGCTGGTTCTCAAACAGGTGCCTTAAAAATATGGGATTTGGAAGCTGCAAAATTATTAAGAACATTCACGGGACATAAAGGTGCTATTAAATGTATGGATTTCCACCCCTATGGGGATTATTTGACAACTGGATCATGTGACAGTAATATCAAACTATGGGATACGAGGAAAAGAGGTTGCATTGTTACATATTCAGGTCATAGACTTGCAGTGAACAGTCTACAATTCAGTCCAGATGGTCAATGGATAGCATCAGCCTGTGAAGATGGTTTGGTAAAAGTGTGGGATGTTCGGATTGGAAAGGTTTTACAAGAGTTCATGGAGCACACTTCAGCTGTGACCTGTGTCAAGTTCCACCCACATGAATTCTTGCTCGCCAGCTGCGGTGCTGACAAAACAGTAAATTTTTGGGACATGGAAAAATTCCAACTAGTATCTAAATTTGAAAAAGAGAACACATCAATAAGGCACATGGTGTTCAGTGATGATGGAGCTACGCTATTAGGATGTGGCAATGATGGGCTACATGTCATAGGATTTGAACCTGCTAGGGTTTTGGACACAGTGAATGGACACTGGGGTCATATACATGACATAACAGTGGCACAGACACAACTTATCGCGGGTTCGTTTCATTCAACCTATGTTGTTTTATCTGTGGTTGATCTCAACAAAGTCCATCCCTTTGGAGGTCCGCCGCCAACTATTGTTAGAGACACCTCGCCATTCCAGAAAGGACAGTCAGTTCGCAAGAGTTTTTCTAAAGAGAAACCGCCTAAAGAAGTCTTGCATAGACCAACACTCCTCGATGAGAGGACGGCAGAGGAATCGACTTCGGGGACGGAGGCAGATGAAGATTCAGGTGCTGTTATAGCTAATATTAATGACTACACTGAAATATTTCGTCCCTCTAGAGCCTTGCCGCGTACACCGCCTCCGGCCACCAGTTTATCATCGGAGGATTTCTCATTGAGTGGTCTAACAGGTGAAGATAACAATCTCGAATCGGGTGCAGCACTTCGGGACCTATCGTTGACACGGCGGGACTACGGAACCGAAAACACTGTCTTTAGTACAGTACTGAAAAGCAATAAGGAAGAGAAGGTTTTCGCCTCCGCCCAGACATCTCTTGTAAAAGAATTCGCAACCAGCCCACTAACAACCTCATCATTAAATAGACATAACTCATATAAAGAGACAAAATCATCAACTGATATAACCTCGTCGAACCTGCGTCAGAGTAATAGCGAGGTATCTTTGGGACCGCCTTCTCTCGCCGGTGCAAGAAATAGTAAGTGCAGACCCCCTAGTCAAATACCCAGATCCCGCGTGGAGCCTCCGCCGCCTCGGTCTCCTGAAGATAGAGTGCCGGAGCCCGAGTTCGTGCCTTACTCCATCGACCGACCAGTCGGACTGGACTTGGATGAATTCCTACCTCGGGGTGCGTGTGCGTCTGGCGTGGGTCGCGGCGCTCGAGGTCACGCTGCCGAGCCTAGTGAGCAGGAAGTGTTAGGGGTCATGATGAGAGGACACGACTCCATGATGACGGTACTGGCCGCCAGACAAAGAGCTCTACAGATATTTCACTCCGTCAGAATAAACAAAAGCCTAAAATCGGCACTGGACTCGGTGATCGCTTTAGAAGACGCGTCTGTGATACTCGACATTCTTAACGTGATGGCTCATAAACCGTCTTTATGGAATTTGGACATATGTTTATTAATGTTGCCCAAGATTTACGAACTGCTGCAGAGCAAATATGAATCGTATATGCAATGCGGGTGCAACGCTTTGAGACTAATAGTACGCAACTTCTCCTCTGTGGTGCGAGCTAACGTGAGCGCGCCAGTGAGGACCCTAGGTGTCGATATACCAAGGGAAGAAAGATACGCCAAATGCGCGCAAATACACAGACTGTTGCTAGACATACGAGCGTTCCTTCTCAAAAGACAAACCCTACAAGGCCGACTCGGAGCTGCCTTCAGGGATCTCCACACGCTGATGCAACAAGGACTTGACTGA

Protein sequence:

>DPOGS206528-PA
METLRRSWKLQEFVAHKANVNCLAMGHKSNQVLATGGDDKKVNLWAIGRQGCLMSLSGHTTPVECVCFGHSEDLVCAGSQTGALKIWDLEAAKLLRTFTGHKGAIKCMDFHPYGDYLTTGSCDSNIKLWDTRKRGCIVTYSGHRLAVNSLQFSPDGQWIASACEDGLVKVWDVRIGKVLQEFMEHTSAVTCVKFHPHEFLLASCGADKTVNFWDMEKFQLVSKFEKENTSIRHMVFSDDGATLLGCGNDGLHVIGFEPARVLDTVNGHWGHIHDITVAQTQLIAGSFHSTYVVLSVVDLNKVHPFGGPPPTIVRDTSPFQKGQSVRKSFSKEKPPKEVLHRPTLLDERTAEESTSGTEADEDSGAVIANINDYTEIFRPSRALPRTPPPATSLSSEDFSLSGLTGEDNNLESGAALRDLSLTRRDYGTENTVFSTVLKSNKEEKVFASAQTSLVKEFATSPLTTSSLNRHNSYKETKSSTDITSSNLRQSNSEVSLGPPSLAGARNSKCRPPSQIPRSRVEPPPPRSPEDRVPEPEFVPYSIDRPVGLDLDEFLPRGACASGVGRGARGHAAEPSEQEVLGVMMRGHDSMMTVLAARQRALQIFHSVRINKSLKSALDSVIALEDASVILDILNVMAHKPSLWNLDICLLMLPKIYELLQSKYESYMQCGCNALRLIVRNFSSVVRANVSAPVRTLGVDIPREERYAKCAQIHRLLLDIRAFLLKRQTLQGRLGAAFRDLHTLMQQGLD-