Monarch geneset OGS2.0

DPOGS210715
TranscriptDPOGS210715-TA7695 bp
ProteinDPOGS210715-PA2564 aa
Genomic positionDPSCF300013 - 248721-273015
RNAseq coverage1832x (Rank: top 7%)
Annotation
HeliconiusHMEL0046680.073.35% 
BombyxBGIBMGA006328-TA0.061.84% 
Drosophilamask-PC0.078.57% 
EBI UniRef50UniRef50_F4WSY20.080.03%Ankyrin repeat and KH domain-containing protein 1 n=8 Tax=root RepID=F4WSY2_ACREC
NCBI RefSeqXP_393472.30.075.90%PREDICTED: similar to ankyrin repeat domain protein 17 isoform a [Apis mellifera]
NCBI nr blastpgi|3071868860.079.79%Ankyrin repeat domain-containing protein 17 [Camponotus floridanus]
NCBI nr blastxgi|2420197420.040.77%multiple ankyrin repeats single kh domain protein, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00037236.8e-13RNA binding
GO:00055156.2e-08protein binding
KEGG pathway 
InterPro domain[197-523] IPR0206835.6e-75Ankyrin repeat-containing domain
[1908-1978] IPR0040876.8e-13K Homology
[1913-1973] IPR0181111.4e-11K Homology, type 1, subgroup
[295-324] IPR0021106.2e-08Ankyrin repeat
Orthology groupMCL10376 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210715-TA
ATGCAGAATGTCGGACAGTCTGAAAATCGAAATATCGATAAGGTGAATGTGCAACTTGATGTGGTAAAATCATCCACTCCTCAGAGTTCGGCTTCGAGTCCGGCCAAATCTGAAACCGAAACGTATTCGGGAGCGCCTGCCAAATACATGACAGACTCTTCGGAAAGCGAGGATGACTCTGTATCTGAGATCTTGCTGGCTTGTTTGTGCCTTAGCAGGCCAGACCTGTTGGCGGAGATGGAGGAAGAAGGTGGCGGTACAGAGTCGTCTAAGTTCCTTCTGTCTCATGATGACCCCGAAAGGGCCGTTGACCCTGAGATGCAGGCTCGCCTCGAAACACTTTTAGAAGTTGCTGGCATAGGCAAGTTGTCCGGTGAGGGCAAACACCTAGCGGACCCCGAGGTGCTCCGTCGACTGACGTCTTCGGTGTCGTGTGCGCTGGACGAGGCGGCCGCGGCCCTGACGCGCATGCGCAGCGACCAGCCCCCCGCCCACCATCGACACCACCATCAAGATAAACCTACTCAGTGTGGATCCACGGCCACGGGGACTACACCGACGGCCGCGGCGTCCGTGGGCGCGGACGGGGCTCCGTCGCTGGCGGAGGCCTGCTCGGACGGTGATGTGGGCACGGTGAGGAAGCTGCTGACCGAGGGCCGGTCGGTTCACGAGACCACTGAGGAGGGCGAATCCCTACTCTCGCTGGCCTGTTCAGCTGGTTACTACGAGCTGGCTCAAGTGCTACTGGCGATGCACGCGAGCGTCGAGGACAGGGGGATCAAGGGCGATTGTACGCCGCTGATGGAGGCCGCGAGTGCCGGTCACGTGGACATCGTGAGGCTGCTGGTCGCTCACGGCGCCGACGTCAACGCTGTCTCGGGCTCCGGGAACACGCCCCTCATGTACGCCTGCGCCGGCGGACACGAGGACTGCGTGCGGGCGCTGCTCGATAACGGGGCCAATGTAGAAGATCACAACGAAAACGGTCACACGCCGCTCATGGAGGCCGCATCAGCCGGTCACGTGGGCGTCGCGAAGATCTTGCTGGAGCACGGCGCCGGCATCAACACGCACTCCAACGAGTTCAAGGAGTCCGCCCTCACGCTCGCATGCTACAAGGGTCACCTGGACATGGTCAGGTTCCTGTTGGCGGCCGGCGCCGACCGCGAGCACAAGACTGACGAGATGCACACCGCCCTCATGGAGGCCAGCATGGACGGACACGTCGAGGTCGCCCGGCTGCTGTTGGACTCTGGAGCACAGGTTAACATGCCGACGGACAGTTTCGAGTCTCCGCTGACCCTGGCGGCGTGCGGGGGACACGTGGAGCTGGCTATGTTGCTGTTGGAGAGAGGCGCCAACATAGAAGAAGTCAACGACGAGGGATACACGCCGCTCATGGAGGCAGCTAGGGAAGGTCACGAGGAGATGGTGGCGCTGCTGCTCGGTCAGGGCGCGTCCATCAACGCTCAGACCGACGAGACGCAGGAGACGGCCCTCACCCTGGCCTGCTGCGGCGGCTTCCTCGAGGTGGCGGACTTCCTCATCAAGGCGGGGGCGGATGCGGAACTGGGAGCTTCCACGCCGCTCATGGAGGCCTCGCAGGAAGGACACCTGGAGCTCGTACGATACCTGCTGCAAGCCGGCGCGGAGGTCCACGCTCAGACGCAGACGGGCGACACGGCGTTGACGTACGCGTGCGAGAACGGACACACGGACGTGGCGGACGTGCTGCTGCGGGCCGGGGCGCTGCTGGAGCACGAGAGCGAGGGAGGCAGGACGCCGCTCATGAAGGCCTGTCGCGCCGGACATCTCTGTACCGTGCAGTTCCTCGTGGGCAAGGGTGCTGACGTGAACCGCATGACGGCCAACGGGGATCACACGCCGCTGTCGCTGGCGTGCGCCGGCGGACATGCGGACGTGGTGAAGTTCCTGCTGGCGTGCGACGCCGACCCCTTCCGCAAGCTCAAGGACAACTCTAGCACACTCATCGAGGCGGCCAAGGGCGGACACACCACCGTCGTGCAGCTGCTGCTAGACTACCCCCACTCCCTCATGTTGCCCAGAGGTAACACGGGTACGGAGGAGAGCGGGGGTCTGAGTTCCGCACAGGCGGCGGCGCTCGGCCTGAGTCACGCCCCGGCGCCGGGCGCGCCCAGCCAGCGAGCGCTGCTCCCCGCGCACGCACCCCCCTCGCACCCTCACGCACATGCCCACCCTCACGCGCACCCCCCGCCGCACGCGCATCCCTCGCACGCCGCTCACCCCGCGCATCCTGCTCACCCCGCCCTCCCCGCGGCCGCGCCGCAGCAGGACGTGCCTCCCAACTTCGCCAAAGTCTATTTGGACGGAAGAAAGAAACAGGCGAGCGGCAACGGCACGGTCCAGCCGGGCGTCCCCGCGCACCCCCCGGCCGGGGCGGCGGGCGCGGGAGGGGCCGGCAAGCACAAGTGCGGCCGCAAGCAGCGTCCCGCCGCGCCGCACTCCGACCACCACCTGCCGCCGCCGCCCGACATACTGGAGGACCATAGGGCGGAGGCGCTCGCCGGTCAGCCGAGGAATGAATCACTGCCACCGACTGAGAGGACGATGCTGGAACTAGCCGACGCCTCCGCACCCCCTGTGGCGGCGCCGGCGACGCCCCCGTACCCACCTCCTCAACAGCTGTTCCCGGTGCAACAACTCTCCACCAACCTCAACCAGAGCACTTCAATACAAGATCGTCCGCGCGTTAAGGCGACTCGCAAGTGCGCTAAGTACGGCTGCAAGTTCTTTCTGCTAGAAGCCCTCAAACGACTCGGTGGACGTATTAGTGACGTCGCGCAACAGTTGCAAGAGCTTCAGCAGCAACAACTCCAGCAGCTAGCTGTACAACAGCAACAGCAGCAAAAGAAGCAACAACATCAGCAACAGCAACAACAGCAGCAGCAGCAGCAGCAACAACAACAGTATGCACAGGAGATGCAGCAGAGGCCTGAAGCATACTCTCCGGAGGAGGGTGGGTCGGTGGAGGCCCGCGAGCTGGGCGCCGTCCTGGACTACCTGCAGCGGGAGGTGCCGTCCCTGGTGGCTCTGCCGCCCAACGAACTGCGCTCACTCGTCCTGCAGGTGATGCAGCAGAAGTCCCACGAGATCCTGTCGGGGAAATGTGGGTCAGAGGGCACGGAGGGTGAGGGGGATGGGGAAGGGGAAGGCGAGGGAGAGGCGGAGGGCGAAGGAGAGGACGCCGAGCGTGACGAGAGACACGCCAGGAGGCTGCTGGCCGCCGCCGAGGAGGCGCTGTCCAGCGACTGGCCCGCTGTACTCGTCGATGTCACGGGTGCTCCGGCCTGCCACTACCGTCCTCGGCCTCCGTCCCCTTCGCTGGAGTCGTGCTCGCTGGGCCTGACCCCGGCCCCGGCGCCCGCGCCTCTAGACAACCAGCCACACTTTGCTCTGCCACCTCCAACACTGCCTTACAACGACTATAGATCATCAACGTTCGGTCCCGCGGCGGGCGGAGCGTCGAGCGGAGTGGCGAGCGGAGTGGTGGGCGGCGTAGTGGCGGGCGTGGCGGCTGGCGTGGCGGGTATCAATGCGATCGGTCCCGCCGGCTCATACACGCCCGCGGGGACTCCGCCGCACACGCAAACCCATTCCAAGAGAGAACAACACCACCATGCCAACAACGCCGCGCTCAAGAAAAAGGGTCGGTTCGCGGGCAGTACCCGGAGTCGCGCGGAGTCGTCTCAGTCGCAGGCCGCGCCACCCCAGCCCGCGCCGGCCGCCTACTCCGCCATGGATGTGGACGGCGAAACGGACTCCAACCACGACACGGCCCTCACCCTAGCCTGTACCGGCGGACACGAGGACCTGGTGGAACTACTGCTGTCCAGGGGCGCGGACATCGAACACCGGGATAAGAAGGGCTTCACACCTCTTATCCTGGCTGCCACGGCCGGCCACGAGAAGATAGTGGAGATCCTGTTGAACCACGGCGCGGATATCGAAGCTCAATCGGAGAGGACGAAGGACACCCCGCTGTCTCTGGCCTGCAGCGGCGGACGTTATGAAGTAGTGGAGCTGATCCTGAGCCGAGGAGCCAACAAAGAACACCGCAACGTGTCCGACTACACCCCGCTCTCGCTCGCCGCCTCCGGGGGATACGTCAACATCATACGACTCCTATTACACCACCAGGCGGAGATAAACTCCCGCACGGGTTCCAAGCTGGGTATATCGCCTCTGATGCTGGCGGCCATGAACGGTCACACGGCCGCTGTGAGACTGTTACTGGACTGCGGCTCCGACATCAACGCTCAGATAGAAACCAACAGGAACACGGCGCTAACACTCGCCTGCTTCCAAGGGCGTCACGAGGTGGTGAGTCTGCTATTGGATCGGAAGGCGAACGTAGAGCATCGCGCCAAGACCGGCCTGACGCCTCTCATGGAGGCGGCCAGCGGAGGATACGTGGAGGTCGGCCGAGTCTTACTGGACAAGGGCGCTGATGTGAACGCGCCCCCCGTCCCATCCTCGAGAGACACCGCGCTAACCATCGCCGCTGATAAGGGACACACCAAATTCGTCGAACTTCTACTGCAAAGACGAGCGGCTGTAGAGGTGAAAAACAAAAAAGGCAACTCCCCGCTGTGGCTGGCGGCTAACGGCGGACACCTGGCTGTAGTGGAGATGTTGTACGCGGCCGGAGCGGACATTGACTCTCAAGACAATAGAAAGGTATCCTGCTTAATGGCCGCTTTCCGAAAGGGACATACTAAAGTGGTCAAATGGATGGTCGGTGTCGTAACGCAATTCCCCTCCGACCAGGAAATGACCAGGTATATCTGCACGATATCGGATAAGGAACTGCTGGAGAAATGCCAGGAGTGCGTGCGGGTCATACGAGCGGCTAAAGAGACGCAGGCCGCGCGCGCCAACCAGAACGCCACCATACTGCTGGAGGAGCTGGACGCTGAGCGCTGCAGGGAGGAGTCCAGGAGGCAGGCGGCGGCGCGGCGTAGGGAGAGAAAGAAGAAGAAGAAGATGGAGAAGAAGGAGGAAAGGCGCAAGTTGCAGACGGAGAACGAAAAGAACACCCTGTACTGCGAGAAGGCGTTGGGAGAGTGTTCGGAGGGAGGCGAACCCGACGACGAGCCCGCGGCCAGGGAGGAGGGCGACTCCGGCATAGACGCCAACTCACAGGGCTCGTGCTCCTCCTCGGACGTGAAGGCGCCTCCAGCTCAAAGTGCCAAGAGCAAGAAAAAGAAAAAGGAAGAAAAGCCAGCGCCCGCCCCGACACAGCCGCCGCCCAAGAAAATACCAGACAAAGTGAAACTGAAAATAGACACAAAACCCGAGAAAGAGGTCCCGGTGAAGGCGGACAAGAAGCTGGAGAAAGAAAACGTGGCGCCCACCTCGCCGCCCGCCACGCCCGCCAAGCCCGCCGCGGACAGGAGGCCAGACAAGAAGGACAAGAAACCCGAAGAGGACGCCAAAAACATCACAGTACAGAACGTCAAGTATGGAAATAACTCGCGGAAGAGTCAAGTGTTCGAGTCCAGCAGACTCAACGTGGACAAGGACGACGACGCCGGCGACAAAAATAAAAAGAGTCACGCGGCGCTGCAATGGGAGGGCGATAAGAGCACGTCTCCTAAGGCAGCCAGCGCCAGTGTGCGGCGAGACGAGGGCTGGAAGGAGGTGGTACGCAAGTCCTCCGTCCAGACGCTCTCCACACTGGAGCCGGGCTGCAAGAAAGTATCAGTGGGCGCTCACGCTATATCCCGTGTGATCGGACGAGCTGGGACTAACATCAATGCCATACGGTCCGCTACTGGAGCTCACATTGAAGTAGACAAGCAGACCAAGGGCCAGGGGGAGAGAATCATCACCATTAAAGGGTCATCGGAGGCGACGAAGCAGGCTGCGAGTCTTATAGCGGCCATGATCAGAGATCCTGAGGCCGACATCTCGGCTCTGTTACCGCGGGCCAAGCTCCCGCCGCCGCAACCTGCGCCCGCGCATCAACCGAAGCCGAAACAGACCCCTGTTAAGATGCCTATGACAGTGAGTTCGATTGTGGGTGGTTCACGAGCGACTCCGCCCAGCCGGTCCAAGCCCCACACCAGCACCAGGCCGCCCATGCCCAGACTGCATGCTCATGCTATACAGCTGCCAGAAAAACGTGTTTCGAGCGCTCCAGCCGTCACCACAACCACGTGTACGACGGTGACCTCTATCAAGACCGGCGCGCTGTCCTACACGGGCGCTATCGTCGGCGCCAGAAGTCACACGTTCGCAGCCAAGTTGACCGCAACGCCACCCGCAGACAGCAAGCCGCGACCCACGCCTCAGGTGGTTCCGAGCAGTCCGGCGGCCAGCAGCAGCAGCACCGTCGTCTCCTCGCCGCTGAAGACGCGCGACACTCGCGAGCCTCCGCCGCGCGAGGCCCGCGAGGTCCGAGAGGTCCGCGAGGTCCAGGCCCGAGAGGTGGTGAGGGAAGCGACACCAGAGACACACAGACTCGATGACGAGCCGAGGATCCCACGCCCGCACGCTGATGCGCTACAATTGAGTCCCGATAACTCGAGCACATGGAGCAATGAAGACATCCCAGTCAATACATCCGCTGCCCTCCATATTAATACCACCCCACAGGTGGCGGGCGGTGTGGGGGTCGTGTCGGGGTCAGGCGGAGCCCAGGAGTATTCCCTGTTCAAGGACCTCTCGGGAGGATCCGTGGCCATGTGGGCCGATCACAACGTTGACCTACCTCCGCCGCAGGCGGACGCCAGTAAGGCCCCCGGGTACCGCGGCGGCGGCGGCTGTTCTCCGTGTTCTCGCACGTCCTCGCACGGCTCCACTCCCCCTCCGCCGCCGCCGCCCCCCTACCACCATCCCATGCCTATCGGCAACGCGGTCAATGCCATGGACATGAGCGGACTGTCAAGAAACGGGCCTATCTACCAGGACAACTCACGCAATGGACACAACATGATGGCAAGTGTGGGTATGTCGAGCGGCGTGTCGCTGTCTGGTCTCGGCTACGTGGGCGTAGAGAGCGTGTCGCGTCTCAACCCGCGCGCTCCTGACTTCGCACAGAGACATCCTCTGCTGCAGCACCAGCAGCACAAACATGCCGCACAGCAACTGTTTTCTGGAGCCGGCGGCACTAGCGGCGGGAACCTGAGCTCGCTGCTTATGTCGTATCAGCAGGGAGCGCCCAAGATGCAGCACGCGCCGCCTCACCACCACCATCCATACCAGTCTCTACTGGACCGCGGCGTGGGCGTGAACTCGGTGGGCAACGTGAGCGGCGTGGGCGTCGGCGTGGGGTGGGGCGAGGAGGAGGAGAGGAAGCCGCGTCCCATCGGCACGGAGCGAGCGTGGAAGATGACCGCGCCCGACGACTGGCACCACCATCACCAGCACCACCGCACAGACCACGACAGATACCAGCAAGGAGTAAACATGGGCGGCGTGGGCAGTGTGGGCGGCGAGGGTGGTTACGGCGCGGGCGTGGGCGGTGGCGGCGCAGCCACGGCCGCCGCTCTGTCTCTGATGCACGCCCTGCCGCTGTCGGCCTGCCTGCCGGCCTACCTGCCGCCCGGCGGCCTGCCGGACCACCATCACTGGGACCAGCCGCCGCACCACGCCACTGATAAACAGGTACCACACTACGCTGGTACTCCAACCATCAAGAGAAGTGTCTGA

Protein sequence:

>DPOGS210715-PA
MQNVGQSENRNIDKVNVQLDVVKSSTPQSSASSPAKSETETYSGAPAKYMTDSSESEDDSVSEILLACLCLSRPDLLAEMEEEGGGTESSKFLLSHDDPERAVDPEMQARLETLLEVAGIGKLSGEGKHLADPEVLRRLTSSVSCALDEAAAALTRMRSDQPPAHHRHHHQDKPTQCGSTATGTTPTAAASVGADGAPSLAEACSDGDVGTVRKLLTEGRSVHETTEEGESLLSLACSAGYYELAQVLLAMHASVEDRGIKGDCTPLMEAASAGHVDIVRLLVAHGADVNAVSGSGNTPLMYACAGGHEDCVRALLDNGANVEDHNENGHTPLMEAASAGHVGVAKILLEHGAGINTHSNEFKESALTLACYKGHLDMVRFLLAAGADREHKTDEMHTALMEASMDGHVEVARLLLDSGAQVNMPTDSFESPLTLAACGGHVELAMLLLERGANIEEVNDEGYTPLMEAAREGHEEMVALLLGQGASINAQTDETQETALTLACCGGFLEVADFLIKAGADAELGASTPLMEASQEGHLELVRYLLQAGAEVHAQTQTGDTALTYACENGHTDVADVLLRAGALLEHESEGGRTPLMKACRAGHLCTVQFLVGKGADVNRMTANGDHTPLSLACAGGHADVVKFLLACDADPFRKLKDNSSTLIEAAKGGHTTVVQLLLDYPHSLMLPRGNTGTEESGGLSSAQAAALGLSHAPAPGAPSQRALLPAHAPPSHPHAHAHPHAHPPPHAHPSHAAHPAHPAHPALPAAAPQQDVPPNFAKVYLDGRKKQASGNGTVQPGVPAHPPAGAAGAGGAGKHKCGRKQRPAAPHSDHHLPPPPDILEDHRAEALAGQPRNESLPPTERTMLELADASAPPVAAPATPPYPPPQQLFPVQQLSTNLNQSTSIQDRPRVKATRKCAKYGCKFFLLEALKRLGGRISDVAQQLQELQQQQLQQLAVQQQQQQKKQQHQQQQQQQQQQQQQQQYAQEMQQRPEAYSPEEGGSVEARELGAVLDYLQREVPSLVALPPNELRSLVLQVMQQKSHEILSGKCGSEGTEGEGDGEGEGEGEAEGEGEDAERDERHARRLLAAAEEALSSDWPAVLVDVTGAPACHYRPRPPSPSLESCSLGLTPAPAPAPLDNQPHFALPPPTLPYNDYRSSTFGPAAGGASSGVASGVVGGVVAGVAAGVAGINAIGPAGSYTPAGTPPHTQTHSKREQHHHANNAALKKKGRFAGSTRSRAESSQSQAAPPQPAPAAYSAMDVDGETDSNHDTALTLACTGGHEDLVELLLSRGADIEHRDKKGFTPLILAATAGHEKIVEILLNHGADIEAQSERTKDTPLSLACSGGRYEVVELILSRGANKEHRNVSDYTPLSLAASGGYVNIIRLLLHHQAEINSRTGSKLGISPLMLAAMNGHTAAVRLLLDCGSDINAQIETNRNTALTLACFQGRHEVVSLLLDRKANVEHRAKTGLTPLMEAASGGYVEVGRVLLDKGADVNAPPVPSSRDTALTIAADKGHTKFVELLLQRRAAVEVKNKKGNSPLWLAANGGHLAVVEMLYAAGADIDSQDNRKVSCLMAAFRKGHTKVVKWMVGVVTQFPSDQEMTRYICTISDKELLEKCQECVRVIRAAKETQAARANQNATILLEELDAERCREESRRQAAARRRERKKKKKMEKKEERRKLQTENEKNTLYCEKALGECSEGGEPDDEPAAREEGDSGIDANSQGSCSSSDVKAPPAQSAKSKKKKKEEKPAPAPTQPPPKKIPDKVKLKIDTKPEKEVPVKADKKLEKENVAPTSPPATPAKPAADRRPDKKDKKPEEDAKNITVQNVKYGNNSRKSQVFESSRLNVDKDDDAGDKNKKSHAALQWEGDKSTSPKAASASVRRDEGWKEVVRKSSVQTLSTLEPGCKKVSVGAHAISRVIGRAGTNINAIRSATGAHIEVDKQTKGQGERIITIKGSSEATKQAASLIAAMIRDPEADISALLPRAKLPPPQPAPAHQPKPKQTPVKMPMTVSSIVGGSRATPPSRSKPHTSTRPPMPRLHAHAIQLPEKRVSSAPAVTTTTCTTVTSIKTGALSYTGAIVGARSHTFAAKLTATPPADSKPRPTPQVVPSSPAASSSSTVVSSPLKTRDTREPPPREAREVREVREVQAREVVREATPETHRLDDEPRIPRPHADALQLSPDNSSTWSNEDIPVNTSAALHINTTPQVAGGVGVVSGSGGAQEYSLFKDLSGGSVAMWADHNVDLPPPQADASKAPGYRGGGGCSPCSRTSSHGSTPPPPPPPPYHHPMPIGNAVNAMDMSGLSRNGPIYQDNSRNGHNMMASVGMSSGVSLSGLGYVGVESVSRLNPRAPDFAQRHPLLQHQQHKHAAQQLFSGAGGTSGGNLSSLLMSYQQGAPKMQHAPPHHHHPYQSLLDRGVGVNSVGNVSGVGVGVGWGEEEERKPRPIGTERAWKMTAPDDWHHHHQHHRTDHDRYQQGVNMGGVGSVGGEGGYGAGVGGGGAATAAALSLMHALPLSACLPAYLPPGGLPDHHHWDQPPHHATDKQVPHYAGTPTIKRSV-