Monarch geneset OGS2.0

DPOGS200999
TranscriptDPOGS200999-TA3060 bp
ProteinDPOGS200999-PA1019 aa
Genomic positionDPSCF300147 - 159518-166184
RNAseq coverage236x (Rank: top 43%)
Annotation
HeliconiusHMEL0137893e-11245.58% 
BombyxBGIBMGA009069-TA5e-12660.23% 
DrosophilaCG9004-PA6e-11443.68% 
EBI UniRef50UniRef50_F4WKN21e-15741.21%Nucleolar MIF4G domain-containing protein 1 n=6 Tax=Formicidae RepID=F4WKN2_ACREC
NCBI RefSeqXP_001600662.12e-15141.35%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3320249374e-15741.21%Nucleolar MIF4G domain-containing protein 1 [Acromyrmex echinatior]
NCBI nr blastxgi|1892342992e-16442.07%PREDICTED: similar to CG9004 CG9004-PA [Tribolium castaneum]
Group
Gene OntologyGO:00054881.1e-30binding
GO:00160704.3e-19RNA metabolic process
GO:00055154.3e-19protein binding
KEGG pathway 
InterPro domain[352-565] IPR0160241.1e-30Armadillo-type fold
[652-758] IPR0038919.5e-23Initiation factor eIF-4 gamma, MA3
[363-562] IPR0038904.3e-19MIF4G-like, type 3
[354-563] IPR0160213e-07MIF4-like, type 1/2/3
Orthology groupMCL13856 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200999-TA
ATGAAAAAACCAAATAATAAGCCAAAATTTACAAATACACGTAAAGTGCTTCGAAAACAAAAGAGACAAGAAAAGAAAGTAAAACGGAAAGAACATTATTTAAAAAAGAATATAGATTCTACTGAGCTTCACCGGTCCACCTCACCGGGAAAGTTTGTAAAAATAAGACCCGAAACATCTGAACCTGATAATGTGGTTAAGAAAAATAAAAAACCAAAAAATCCGCCTACTGTTCAAGAATTATTAAAATTGGAGCAAGAAAAAGAGAAGAGAGCCACTGATAAATTAAAATCGATGATGAATGAACAAAGAAGGAAGATGTTATTGGAAGCTAATGATGCGGAGGATAAGATTATTAAGAAGCTAGAGAAACAGCTCGGCTTGAACAAGACTCGAAACAAAAATAACTTTTTTGCTGACGATGGGCTGGATTATTTACTAGAGGTGTGTGATAGAACGACGTCAGAACAAATTGTGGCAGCGGAAAAACATTTGGCTGAAGTTGAAAGAGATTCCGACTTTGAAGATGATTTGGCTGCGGTTACAGGAAAAGAGCCACATCGTAAAAAGGAAAAGGAAAAAGAGATAACAGAAGACGGACATGATTCAGTTGATGATATGGATGAAGATGATGAGTTAGGAAGTGATGATGATATGCTGGGTGAAGACAGTGAAATGAGTGAAGATGGAACCGATTTTAAAGATAATGGTGAGTCTGATGATGATGGAGAAACTGGTGATGATGGAGAAGATGACAGTGAAGATGATGGTGATGAAGAACCAGAACGAAAAAAAAGATCAAATAACAAGAACAATAACAAGACAAAGGAAAAGATAATTGCAGAAGAAGATTTGAGTAAAATATTTAGTGACGATGAAGTGTCACATTTATCCGATGATGAAGAGTTGAGCGGAGGTGAAGAGAATTCAGAAATAGAACCTAAAGAGAAGCCGGATGTATGGGAGGACATTTATGGAAGGAAGAGAGATAAAGAAGGGAACATCATTAAGGAGGAAAAAGGCATTTACATCCCACCACATTTGAGGAACAAGGACTCAACATCTGAGAAGGAGATGGCACAACTGAAACGGCAAGTTAAAAGTGTTCTGAACAAGTTGGCTGGTACAAACCTTCACTGGGCTTGCACCAGTATAGAGAATCTGTACACCAGCAACAGCAGACACTCGATGAATACAGTATTGACATCGCTGTATATGGAGGGAGTCGTTGGAAGGTCTATGACTCCGGAGAGGATGCTGGCAGAACATGCGGCTATGATAGCTGTGTTGCATGCCAATGTTGGCTCCGAAATTGGAGCACATTTCTTGGAGGAGTTGTGTAAGAGGTTTGACGCAATGATGGACACACCACAACCAGTCGAGGACAAGACCTTAGACAATCTAGTTGCTTGCTTGGCACATTTGTTTTGTTTCAAGTTGTACCAATCTACGTTACTATTCGATATTCTGTCGCGTCTGACGCACACGTTGTCCGAGAAGTGCATAGACGTGTTGTTAGTGTGCGTGAGGTGCGCCGGCGCAGCTCTAAGGAAGGAAGCGCCTCTCGAACTCAAGACCTTCATACATGACACACAGGCTCGGAGTACCAAGATAGGGGCTGGTGTGACAGACGGGTCCCGTATAAAGTTTCTACTAGAAGTACTGCTGGCTATTAAGAATAATAATTTAAACAAAATACCCAACTACGATCCGAGCTACGTGGAACACCTGAAGAAGATGACTCGGAGTATTGTCAGGAAAGGAAATTATATAACACCCTTGAACATACGGCTGGAGGATCTATTGAAAGCCCAGGAACGCGGCAAGTGGTGGGTGGTCGGGTCGGCCTGGGAGGGGCAGGCTGAGGTCGGAGACAAACAGACAGAGAAACAGACAACGCACGCTGACCAGAGAATGATGGAACTGGCGAGGAAACAGAGAATGAATACCGATGTCAGGCGGAGCATCTTCTGTGTTATTATGTCTGCTGAGGACTACATGGACGCCTTTTCTAAGTTGGAACAGCTCTCCTTGAAGGGACAACAGCAGCGTGAGATATCGCATGTGTTGTTGTCTTGCTCTCTCCATGAGAAGGCTTACAACCCCTACTACAGTGTGCTGGCTGACAAACTGTGCAGTGTAGATAGGAAATATCAGCTATCAATACAGTACTCTGTGTGGGACAAAATAAAGGAAATTGAAACTCTATCCAAACAGTCAATGACCAACTTGGCACAGTTTCTCATCCATCTGTTTGTATCCAAAGCACTGCCGCTGTCCATTCTCAAGGAACGCGGCAAGTGGTGGGTGGTCGGGTCGGCCTGGGAGGGGCAGGCTGAGGTCGGAGACAAACAGACAGACAAACAGACAACGCACGCTGACCAGAGAATGATGGAACTGGCGAGGAAACAGAGAATGAATACCGATGTCAGGCGGAGCATCTTCTGTGTTATTATGTCTGCTGAGGACTACATGGACGCCTTTTCTAAGTTGGAACAGCTCTCCTTGAAGGGACAACAGCAGCGTGAGATATCGCATGTGTTGTTGTCTTGCTCTCTCCATGAGAAGGCCTACAACCCCTACTACAGTGTGCTGGCTGACAAACTGTGCAGTGTAGATAGGAAATATCAGCTATCAATACAGTACTCTGTGTGGGACAAAATAAAGGAAATTGAAACTCTATCCAAACAGTCAACGACCAACTTGGCACAGTTTCTCATCCATCTGTTTATATCCAAAGCACTGCCGCTGTCCATTCTCAAGATAATCCAATTCTCCGATTTAAACAAGAAGACCGTCCGGTTCATGAGACAGATACTCCTGGCTGTCATAATGAATGATAACTTGCAAGCGTCGCTGGAAGTGTTCCACAGGATAGCCAAACCCCCGAAGCTGCACATGTTTAGGGAGAGTCTGAGGTTATTCATTCAGCACTTCCTAATAAAGAACGCCGGCAAGCAGAGCGCGGTGTTGAGTGAAGAGGAGATGAGGACCTTGAGGGAACGGGCACAGGAAGTTGATAAGATTCTCACCATGCACGAAACTAAATTGAGATTTTGA

Protein sequence:

>DPOGS200999-PA
MKKPNNKPKFTNTRKVLRKQKRQEKKVKRKEHYLKKNIDSTELHRSTSPGKFVKIRPETSEPDNVVKKNKKPKNPPTVQELLKLEQEKEKRATDKLKSMMNEQRRKMLLEANDAEDKIIKKLEKQLGLNKTRNKNNFFADDGLDYLLEVCDRTTSEQIVAAEKHLAEVERDSDFEDDLAAVTGKEPHRKKEKEKEITEDGHDSVDDMDEDDELGSDDDMLGEDSEMSEDGTDFKDNGESDDDGETGDDGEDDSEDDGDEEPERKKRSNNKNNNKTKEKIIAEEDLSKIFSDDEVSHLSDDEELSGGEENSEIEPKEKPDVWEDIYGRKRDKEGNIIKEEKGIYIPPHLRNKDSTSEKEMAQLKRQVKSVLNKLAGTNLHWACTSIENLYTSNSRHSMNTVLTSLYMEGVVGRSMTPERMLAEHAAMIAVLHANVGSEIGAHFLEELCKRFDAMMDTPQPVEDKTLDNLVACLAHLFCFKLYQSTLLFDILSRLTHTLSEKCIDVLLVCVRCAGAALRKEAPLELKTFIHDTQARSTKIGAGVTDGSRIKFLLEVLLAIKNNNLNKIPNYDPSYVEHLKKMTRSIVRKGNYITPLNIRLEDLLKAQERGKWWVVGSAWEGQAEVGDKQTEKQTTHADQRMMELARKQRMNTDVRRSIFCVIMSAEDYMDAFSKLEQLSLKGQQQREISHVLLSCSLHEKAYNPYYSVLADKLCSVDRKYQLSIQYSVWDKIKEIETLSKQSMTNLAQFLIHLFVSKALPLSILKERGKWWVVGSAWEGQAEVGDKQTDKQTTHADQRMMELARKQRMNTDVRRSIFCVIMSAEDYMDAFSKLEQLSLKGQQQREISHVLLSCSLHEKAYNPYYSVLADKLCSVDRKYQLSIQYSVWDKIKEIETLSKQSTTNLAQFLIHLFISKALPLSILKIIQFSDLNKKTVRFMRQILLAVIMNDNLQASLEVFHRIAKPPKLHMFRESLRLFIQHFLIKNAGKQSAVLSEEEMRTLRERAQEVDKILTMHETKLRF-