Monarch geneset OGS2.0

DPOGS211172
TranscriptDPOGS211172-TA2094 bp
ProteinDPOGS211172-PA697 aa
Genomic positionDPSCF300007 + 317857-323613
RNAseq coverage162x (Rank: top 52%)
Annotation
HeliconiusHMEL0172290.083.47% 
BombyxBGIBMGA003157-TA0.088.21% 
DrosophilaCG11399-PB0.057.32% 
EBI UniRef50UniRef50_E2A9L90.067.92%Phosphorylated CTD-interacting factor 1 n=11 Tax=Neoptera RepID=E2A9L9_CAMFO
NCBI RefSeqXP_624144.10.068.45%PREDICTED: similar to CG11399-PB [Apis mellifera]
NCBI nr blastpgi|3407269280.070.02%PREDICTED: LOW QUALITY PROTEIN: phosphorylated CTD-interacting factor 1-like [Bombus terrestris]
NCBI nr blastxgi|3072080750.068.21%Phosphorylated CTD-interacting factor 1 [Harpegnathos saltator]
Group
Gene OntologyGO:00055156.3e-07protein binding
KEGG pathway 
InterPro domain[415-593] IPR0220351.9e-62Phosphorylated CTD interacting factor 1, WW domain
[64-89] IPR0012026.3e-07WW/Rsp5/WWP
Orthology groupMCL14327 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211172-TA
ATGAATGATGTTCGCGAAAAAGTTGTTGCTGGGTCTTCGACATGGGAAAGTGCTCCGCCGCATAGCGACTCATCTCCTGAAATTCATTCATCCCCCGGAGGCAGCGGGGAAACTCCTCAAACTCCTGGAGCCCCAGCTCCTCTTGCACCGCATCTTGCTGCAGATTTGCATCCAGATTTAATACAACAAGGTTGGCAAAAATATTGGTCAAGGCGTGAAAACAGACCTTATTTTTGGAACAAAATTTCTGGCGAGTCTATGTGGGAGTTACCAGTGGTGAAAAGAGATTTTGACCCAATAACTGACCCCTTAGGTATATGTCACACAGGTCCACCCAATGCTGGGAATGTCACTCCAAGTGCTAGTAAACGACGTCCATCTGAAGACAATGGGCCTCCACCAAAAAAGTTTGTTCTTGCCGGACCTTGGGATATTGAAGTCCCAACTAATGTCATAATATATGAACGGCCACCAACTATTTGCCCTCATCCTCATCCAGAGATTGAAGGATTTAGATTTACATTAGCTAATAAGTTAAGGCAATGTTATCAGGAATTGTGTCACACTAGAGAGAGTATAGACGCCCCAAAAGATTCTTTTAATAGATGGCTGATGGAGAGAAAGGTGAACGACCAGGGCGGTTCAGACCCATTACTTCCTAGCCATTGTTTTCCTGAAATATCTCATTCAATGTACGAAGAAATTATGAATGATATACCCTTAAAATTGGCTCATCCAAAATTTACTGGAGATGCTCGAAAACAACTCTCAAGATATGCGGAAGCAGCAAAAAAAATGATTGAATCTAGAAATGCATCTCCAGAAAGTAGAAAAGTTGTGAAGTGGAACGCCGAAGATACTTTCCAGTGGCTGAGACGGACCGTGGGTGCAACTTATGACGACTTCCAAGACAGATTGGCACATTTACGGAGACAATGTCAACCACACCTGGCAGAAACAGCAAAAGCATCGGTGGAAGGAATTTGTCTTAAGATTTATCATTTGTCAGCCGAATACGCAAGAAAGATAAGAGAAAAACATAGTGTGTTATTAAAAGAAAATGGTATACAGGAGTTAGCGGCCCCGCTTCAACAGGCTGCTCTTCGTAAGGTGTGGTGCTATCCAGTGCAATGTGCACTGCCTTCACCGAGGCCACCCCTAGTGGAACACTTTCTGGACCGAGACCAAGTGTTGCTGAGATATCTCGGGGAGACACAAGTCATCAACGCCAATTACCTACAAAAGCTCGAGCAATTGTACCGCTACAGCTGTTTCGATGACAAGAAGTTCGAGCAGTTCCTGTCCCGCGTGTGGTGCCTCCTGAGGCGGTACGCGGCGTGGGTGGGTGGAGCGGGGGCCGGGGTCAACGACTCCCACGTCACACAGATGGCGCTGCCGGTGCCGGTGCTCGATTGCCTCCATCGATACTTCGGCGTGACCTTCGAATGCTTCGCCAGCCCCCTCGACTGCTATTTCAGACAATACTGCTCTGCGTTTGCGGATACTGATTCCTATTTCGGCTCCCGAGGCCCTTTCCTTGAGCTCCGGCCCGTGTCGGGGTCGTTGGTGGCGCACCCCCCTTACTGCGAGGAGCTCCTGGCTGCGGCGCTGCGACACATGGAACGCCTGCTGCAGGACTCCGCGGAGCCTCTCAGTTTCGTGGTAGTACTCCCCGAGTGGCCTGACAAACAGACACACGCACTGCACAAACTGCAGGCCAGCCACTTTAAGAGGAAACAGGTGGTCATACCAGCGTTTGAGCACGAGTATCGCCACGGTTTCCAACATGTACTTCCAAACCCTTTCCTTGAGCTCCGGCCCGTGTCGGGGTCGTTGGTGGCGCACCCCCCTTACTGCGAGGAGCTCCTGGCTGCGGCGCTGCGACACATGGAACGCCTGCTGCAGGACTCCGCGGAGCCTCTCAGTTTCGTGGTCGTACTCCCCGAGTGGCCTGACAAACAGACACACGCACTGCACAAACTGCAGGCCAGCCACTTTAAGAGGAAACAGGTGGTCATACCAGCGTTTGAACACGAGTATCGCCACGGTTTCCAACATGTACTTCCAAAAAACGAAGTTTATTTCCGTTCATAA

Protein sequence:

>DPOGS211172-PA
MNDVREKVVAGSSTWESAPPHSDSSPEIHSSPGGSGETPQTPGAPAPLAPHLAADLHPDLIQQGWQKYWSRRENRPYFWNKISGESMWELPVVKRDFDPITDPLGICHTGPPNAGNVTPSASKRRPSEDNGPPPKKFVLAGPWDIEVPTNVIIYERPPTICPHPHPEIEGFRFTLANKLRQCYQELCHTRESIDAPKDSFNRWLMERKVNDQGGSDPLLPSHCFPEISHSMYEEIMNDIPLKLAHPKFTGDARKQLSRYAEAAKKMIESRNASPESRKVVKWNAEDTFQWLRRTVGATYDDFQDRLAHLRRQCQPHLAETAKASVEGICLKIYHLSAEYARKIREKHSVLLKENGIQELAAPLQQAALRKVWCYPVQCALPSPRPPLVEHFLDRDQVLLRYLGETQVINANYLQKLEQLYRYSCFDDKKFEQFLSRVWCLLRRYAAWVGGAGAGVNDSHVTQMALPVPVLDCLHRYFGVTFECFASPLDCYFRQYCSAFADTDSYFGSRGPFLELRPVSGSLVAHPPYCEELLAAALRHMERLLQDSAEPLSFVVVLPEWPDKQTHALHKLQASHFKRKQVVIPAFEHEYRHGFQHVLPNPFLELRPVSGSLVAHPPYCEELLAAALRHMERLLQDSAEPLSFVVVLPEWPDKQTHALHKLQASHFKRKQVVIPAFEHEYRHGFQHVLPKNEVYFRS-