Monarch geneset OGS2.0

DPOGS202772
TranscriptDPOGS202772-TA1764 bp
ProteinDPOGS202772-PA587 aa
Genomic positionDPSCF300018 - 1073280-1076042
RNAseq coverage250x (Rank: top 42%)
Annotation
HeliconiusHMEL0026850.066.90% 
BombyxBGIBMGA010495-TA0.063.26% 
DrosophilaCaf1-105-PA4e-11348.79% 
EBI UniRef50UniRef50_Q16HR35e-12857.95%Chromatin assembly factor i P60 subunit n=4 Tax=Pancrustacea RepID=Q16HR3_AEDAE
NCBI RefSeqXP_308335.32e-12954.20%AGAP007544-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582854833e-12854.20%AGAP007544-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582854834e-12646.03%AGAP007544-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055151.1e-50protein binding
KEGG pathway 
InterPro domain[10-381] IPR0159431.1e-50WD40/YVTN repeat-like-containing domain
[13-380] IPR0110465.7e-47WD40 repeat-like-containing domain
[115-154] IPR0016801.5e-07WD40 repeat
[118-154] IPR0197811.8e-07WD40 repeat, subgroup
Orthology groupMCL12906 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202772-TA
ATGAAGTTTGCTATACCTGAAATATCATGGCATAACAGAGATCCAGTTTTAAGTGTAGACATTCAGCCCAAAACAAATGCAAGTGAACCACTGCGGTTAGCTACCGGGGGCACAGATTCTCATGTTGTGATATGGTATTTATCAAAAACAATAACCGGTTCAGTGAAATTAGAAGTCGCTACTGATCTCACCAGGCATCAAAAAGCCGTTAATGTAGTGAGATGGTCGCCCAATGGTGTCTACTTAGCATCTGGAGATGATGAATCTATCATATTTATATGGAAGCAAAAGACGGAAGAGCCAATAGCACCACCCTTAGAGGGAGAGGAGCAGTATAAAGAGACTTGGGTTATACATAAAACTTTAAGGGGTCACATGGAGGATGTTCTGGACATCAGTTGGAGTAGTTCATCACTACATTTGGCATCCGGCTCAGTAGACAACAAGCTGATTGTCTGGGATGTGGCGAGAGCTCGATCTAGTGGTATTGTCTCTGATCATAAAGGCTTTGTCCAGGGAGTAGCATGGGACCCTCAAGGACAGCTGATAGCCACAGCTAGCTCGGATAGAGTTTTCCGAACATTTGATGTGGGGACTAAGAAAGTGTTGTCTCGTAGCAGTAAGGCTATTCTACCGTTCCCTAAGGAGCATACCCTACATGAAGTGAAGGTCCGCCTCTACCATGACGACACTCTACAGACGTACTACAGGAGATTACATTTCAGTCCCGATGGAATGTTCATTGCTGTGCCGGCCGGAAGAATAGAACCAGAACAAGGCAAACTGGACATTAAACCAATGAATGCTGTTTACATTTACACTAGACACTCTCTCAAAACTCCTGCGTGTGTGGTTCCGTGTGGAGAGCCGGCGCTGGTGTGCCGCTGGTCGCCCGTGCGTCGTGCGGCGCGGACTTCGCCCCCCGCGCCGTCTGCTTTGCAGCACGCCCCTCGGCTTCTGCTGGCGGTGGCCACGCGGAGATCGCTGCTGTTGTACGACACGCACCAGAAAGCGCCCGTCGCGCTCATCTCAAACATACACTACACCAGGATCACAGACCTTTCGTGGTCTTCCGACGGCCTGACCCTAGTGGCCTCCAGCACTGACGGTTTCTGCTCCGTCGTCAGTTTCACCGAGGAAGAGCTGGGCGAGGCGCTCACCACCGCGGACGCCGTTAGTGCAGAGCCGATGGAAACGGAGGAACAGAAACATAACCAAGAAACTCCTAAACAGAGACACGCTGAGGCGAAACCCATAGAAGTCAAGCGGAGGCCGTCCTCGAACAACACCAAAATAGACGCCTTCATTAAGTTTAAAACTCCCGAAGATAAGTCTCCGAAGAAGAAGAAGATCGAAAACATTCAGCAGAAGACGCCCGTCAAGATGGACGTCCTCATGGAGACCGCGCTGCCATCCTGGTCTGACAACTCCAGCAACGACCTCATCAGACCCAAGGACACGGAGACCGCGACCCTCGGCGACGAAAATGACGTCACCGTCATAGAGGACAGCGAGGACATCCAGCTGGTCTACGAGGAGACCAAGGACGGCCAGTCGCCCAAGACGGAACCCTCGGAGGAAAAACCTGCTCCCAAGACGATGTCTCCCAAACAATGCGGCACGGCCGACAGCAACTTCCTAATGAAGGCAAAGATCACCGACATCAGGGAGCCGGCGCCGCTCACCGCCGTGCCGAGTCCCAAGGCACCGCGGAGAGTCAGCTTCGTGACGCTGTCGAGTCCTAAGAGCACGAAAAAAAAATAA

Protein sequence:

>DPOGS202772-PA
MKFAIPEISWHNRDPVLSVDIQPKTNASEPLRLATGGTDSHVVIWYLSKTITGSVKLEVATDLTRHQKAVNVVRWSPNGVYLASGDDESIIFIWKQKTEEPIAPPLEGEEQYKETWVIHKTLRGHMEDVLDISWSSSSLHLASGSVDNKLIVWDVARARSSGIVSDHKGFVQGVAWDPQGQLIATASSDRVFRTFDVGTKKVLSRSSKAILPFPKEHTLHEVKVRLYHDDTLQTYYRRLHFSPDGMFIAVPAGRIEPEQGKLDIKPMNAVYIYTRHSLKTPACVVPCGEPALVCRWSPVRRAARTSPPAPSALQHAPRLLLAVATRRSLLLYDTHQKAPVALISNIHYTRITDLSWSSDGLTLVASSTDGFCSVVSFTEEELGEALTTADAVSAEPMETEEQKHNQETPKQRHAEAKPIEVKRRPSSNNTKIDAFIKFKTPEDKSPKKKKIENIQQKTPVKMDVLMETALPSWSDNSSNDLIRPKDTETATLGDENDVTVIEDSEDIQLVYEETKDGQSPKTEPSEEKPAPKTMSPKQCGTADSNFLMKAKITDIREPAPLTAVPSPKAPRRVSFVTLSSPKSTKKK-