Monarch geneset OGS2.0

DPOGS213608
TranscriptDPOGS213608-TA3840 bp
ProteinDPOGS213608-PA1279 aa
Genomic positionDPSCF300033 + 799933-809279
RNAseq coverage203x (Rank: top 47%)
Annotation
HeliconiusHMEL0136792e-13545.18% 
BombyxBGIBMGA011671-TA8e-13650.83% 
DrosophilaCG7504-PB9e-5529.69% 
EBI UniRef50UniRef50_E2APV25e-9232.12%Helicase sen1 n=2 Tax=Formicidae RepID=E2APV2_CAMFO
NCBI RefSeqXP_001604330.19e-8931.13%PREDICTED: similar to splicing endonuclease positive effector sen1 [Nasonia vitripennis]
NCBI nr blastpgi|3071737912e-9132.12%Helicase sen1 [Camponotus floridanus]
NCBI nr blastxgi|3838559884e-9925.71%PREDICTED: uncharacterized protein LOC100875185 [Megachile rotundata]
Group
Gene OntologyGO:00055151.2e-20protein binding
KEGG pathway 
InterPro domain[15-89] IPR0089841.2e-20SMAD/FHA domain
[13-90] IPR0002531.2e-19Forkhead-associated (FHA) domain
Orthology groupMCL17021 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213608-TA
ATGTCTAAATGGAAGTTGTGCCGCCAAACTTTCGGAAATGATGAATTTGAATTAAAAGAAGGTGAAGTAACAATTGGCAGGGGTGTAAATAATACAATAAAACTTTCAAGTATAGTTATATCGAGGAACCATTGCTTAATTTGTGTCAAACAAGATGAAGTAGTTATCACCGATTTGAAGAGCTCCAATGGTGTGTATATAGGAGTAAAAAAAATAGCACCAGAAATCCCATATACTTTAAGTAATAATGACATTATAGGCCTTGGTTGGGCAATCGGAGCCCCTCTAGGAAATATAAAGGACAATGAAAAGCATATTTACAAACTAGTCAAAAATGATCCACCTGAAACGAATTTAAAACGTAAATTAGATATAAGGGAAACTATTCCCATAGTAAATATTATAGAATATGGGCAAAACAGCGAAATCAGTTCAAATGGTAGCAGTTCTGGTGAAATGATAAACAGAACGGAAATTACTAGTTTTGAAGCAAAAAAAGCAAAACTTGAACCAAATTTCATTGACCTAACTGACGATCATAATGAAGATTTTCCACAAGGTTTCATTAGAAATAAACAAGATTATTATATTATTAATGAGGAACCTGAAATCATACTTAGTGACAGTGATTACGACGCCGAGTATAGGTTTGTCAGATTATCAGAAAATTTACAAAAACCTTCTTTAAAAATGGAAAAACCAGACACTTCCATGGAAAATTCTACATACAGTCAACAAGATGATGATATCGAAGTGGACTCGGATGAAGTTGAAGAAGTAAATGAACCTGAACTATCAGATAAAACACCAAATAAAGCATTATTAAAGGACTATACAAACAAAGAATGTAACATATTGGACGAAATAAACGAAAATGTTAATGATGACATTGTTATTTACAAGAGACCATCCGAAAAGAGTAATTTTCTATCATTCCAACAAACTGATAATAATTTAAATACAGTGAAAAAATCACAAATACAAGAAATCAGTATATGTTGTGGAGACAAAAAAGAATGTCATGACACAGAAAATATTACACAAACAAGTTCAGAAGAAAAAAGAATCAATGGAAAATCATCAAAAATAATTCCTAATACTCAACTACCTGCTCCAATACTAAGTAAAACCAAAGCTAAAAAAAAATCAAAGGATCAAGTAAAACAGACAAAAACGAAAAAAAATAAATCAAACTCAATGAAAAAACCAAAACTAACAGATTCTCAAAAAGAGAAAAGGAGACAGAAACTTAAAGAAATATGTGAAAAAGAAATTTGTCCCGTGACACACTCTAATGTTGAATCAAAAGTACATGAAAATAGTAAAAATAGTATAAGTACAAACAAAACACAGAACCCTAAAGATATCCCTACGCTACAGGAGATTGAAAAAGTGAATAACAAAATAAAATCTTTTTCTCAGTCAAAAAAACAAATTCAAACTATCGAACCTCACGTAATGAAGCCTGGGAAAACAAGAACCGCTCGTAGTAAAGCTAAACAAGATACTGAACCCAAAGATAGCACAGATATAAATATGAGCAACGATCCTAAGACACCCGCAGTGGTACAGAATGAGGATGTCAACAATATTAAAATGGCCAAGCAAAGTCAAAGGCCCTTACTAGTTCATCCACATATATGCTTGTCACTAAGAAATAAAGTTCTCATAAAGATCCTCACATGGAACCCTAATTGGCTTGAGGAACAAAAGAAACAATCCCAACCAGCTCCTATATTAAAAAATGAAACAGCACTTTTGACGCTATTTTTATCGTTCAACAACTGTGAACAGTATGTCGGACAAATGAAAAATTTATTACTCATAGAAATATGGGAATATCTTACACAGGAGTACAATTATCCAAACAAAAGGCAATATTTAAATCTTCGAATTGAAACTTTACCTCCGAAGCCACCCAAGGAGAGGTATTCTGAGTTATTTACCATAACTGTTAGAATGCTAATGTCCACGAGCAGTATGAAACTGGTGCCCCGGCACGGTGATATAATGATATTGAAATTCAGGGAACACAATGAGAATAATTCGTATAAATTAAAATTTTGCTTGATTCACAAAGTCGATTATATATCATCTCACAATAACAAAACAATTGTTCTATCATGCCACGTCACTTACTCCAGTCTGTTTAAGAGCTTGCAACCGGGTGATGTGCTGACCGGGGAGAACATAACTAATATAAACAAAGAACTATCACTGTTTGAAGCGATGAAAGTGCTGGAGAGATCACCTATAAGAGAGTTCATATTGAAACCTGACCCGGCACAGTATATTGATATTGATAGGAATGTGTCCACGGGCACAGAGTCACAGTGGACGACTGCGTTGAACGACAGCCAGCGGCGAGCTGTGGCGGAGTCTGTCAGCGCCGCTCTTGGTTCTCAACCAGTATTGAGAATGATACAAGGACCACCGGGGACCGGAAAGTCTAGAGTGATATGCTCCATCGTAATGGCGTACTTTTATGGAAATTCAATGAGAAAACAAAGTAAGAGGGGCAAGATATTGATTTGTGCGACAAGCAACGCGGCCGTCGACGAGCTGGTTATCAGGTTACTGAACATGAGAGACACTCTAGAGGAAGGTGAACGTTTCCGTCTAGTCCGTGTTGGGCGGCTGGAGTCCATGCATAAAGACGTCAGAGACGTCAGCACGCAAGCAACGGCTCAGAGGGTACTCATGCGGAGAGAGGACTCCAACAGTGACGTCACTAAGGAGATTGCTCTTAATCAAGCCATGATCGATAGATGGAAGGCCGAGAAGGCGACCGACCCGGCCAGGGCCGCGTACTGCGACGACCGCGCGAGGTACTTCGCTAGACAAATAGAACTGCTTCGCGGCGGCGGCGTGCGTCCGGGCCAGCTCGTGGATGTGGAGCGTCGGTTGGTGGAAGAAGCCGACATCGTGGCCACAACCCTCGCCAGCTCAGTCAACCACAAGATGCGGGGGTTGAGAGGTATCGAACTTTGCATAGTGGAGGAAGCCGGTCAGGCCATCGAACCCGAGACCTTGCTGCCTCTGATGCTGGGAGTCAACAAGATGACGCTGGTGGGAGACCCGCAACAACTGCCGGGGTACATCTGTTCGGAGCGAGCCAAGACGCACGGTCTGGACAGGAGTCTGTTCTCCCGCTTGGCCGCGTATAGTGAGTGCTGGGAACGGCCACCGTTGGTGCTACTGGATCGCCAGTACAGGATGCACCCCACCATCGCCGACTACCCCAACAGAGCCTTCTATGGGGGGAGGGTGCAGTCCGTGCCCCCACCACCACTCTCACTACACCTACCACCCTACTGCATACTCGACATACCGGGGAGTGAACATGACGAGGCGTGGGGCGCGGCCCGGGTGGCGCTGGCGGTGTTCTCAGCGGCCCGAGCCCACAACCCTCAGCTGTCCGTGGCCGTCATCACACCGTACGTCGCTCACAGAGACTTACTGAGGAAGTACCTCCTGGAACTCGACGAGTCTGCTCGTGGCGTGGAAGTCAACACGGTGGACAGCTTCCAGGGTCAGGAGCGGGACCTGGTGGTGGTGTCGCTGGGGCGGCGGCAGGGCGTCGGCTTCCTGGCACATGCTGGACGGATGAACGTCCTCCTCACGCGAGCCAGACACGTCCTCATACTCTGCCTATACAAAAACGCTGTCGAGAAACACGATCAATGGCGTACTTTAGTCAGAGATGCCGAAGGAAAGAGCTTGTTCAAGACCCTCCCCAGCTACATGTGCAGACCCAGCGGCCACTCTGCCAGGCAGGCTTCCTCCAAGGAAGTGTTAGAATTCTTAAGATCTAAAAAGAATAAGCACGGACATAAATGA

Protein sequence:

>DPOGS213608-PA
MSKWKLCRQTFGNDEFELKEGEVTIGRGVNNTIKLSSIVISRNHCLICVKQDEVVITDLKSSNGVYIGVKKIAPEIPYTLSNNDIIGLGWAIGAPLGNIKDNEKHIYKLVKNDPPETNLKRKLDIRETIPIVNIIEYGQNSEISSNGSSSGEMINRTEITSFEAKKAKLEPNFIDLTDDHNEDFPQGFIRNKQDYYIINEEPEIILSDSDYDAEYRFVRLSENLQKPSLKMEKPDTSMENSTYSQQDDDIEVDSDEVEEVNEPELSDKTPNKALLKDYTNKECNILDEINENVNDDIVIYKRPSEKSNFLSFQQTDNNLNTVKKSQIQEISICCGDKKECHDTENITQTSSEEKRINGKSSKIIPNTQLPAPILSKTKAKKKSKDQVKQTKTKKNKSNSMKKPKLTDSQKEKRRQKLKEICEKEICPVTHSNVESKVHENSKNSISTNKTQNPKDIPTLQEIEKVNNKIKSFSQSKKQIQTIEPHVMKPGKTRTARSKAKQDTEPKDSTDINMSNDPKTPAVVQNEDVNNIKMAKQSQRPLLVHPHICLSLRNKVLIKILTWNPNWLEEQKKQSQPAPILKNETALLTLFLSFNNCEQYVGQMKNLLLIEIWEYLTQEYNYPNKRQYLNLRIETLPPKPPKERYSELFTITVRMLMSTSSMKLVPRHGDIMILKFREHNENNSYKLKFCLIHKVDYISSHNNKTIVLSCHVTYSSLFKSLQPGDVLTGENITNINKELSLFEAMKVLERSPIREFILKPDPAQYIDIDRNVSTGTESQWTTALNDSQRRAVAESVSAALGSQPVLRMIQGPPGTGKSRVICSIVMAYFYGNSMRKQSKRGKILICATSNAAVDELVIRLLNMRDTLEEGERFRLVRVGRLESMHKDVRDVSTQATAQRVLMRREDSNSDVTKEIALNQAMIDRWKAEKATDPARAAYCDDRARYFARQIELLRGGGVRPGQLVDVERRLVEEADIVATTLASSVNHKMRGLRGIELCIVEEAGQAIEPETLLPLMLGVNKMTLVGDPQQLPGYICSERAKTHGLDRSLFSRLAAYSECWERPPLVLLDRQYRMHPTIADYPNRAFYGGRVQSVPPPPLSLHLPPYCILDIPGSEHDEAWGAARVALAVFSAARAHNPQLSVAVITPYVAHRDLLRKYLLELDESARGVEVNTVDSFQGQERDLVVVSLGRRQGVGFLAHAGRMNVLLTRARHVLILCLYKNAVEKHDQWRTLVRDAEGKSLFKTLPSYMCRPSGHSARQASSKEVLEFLRSKKNKHGHK-