Monarch geneset OGS2.0

DPOGS212022
TranscriptDPOGS212022-TA4020 bp
ProteinDPOGS212022-PA1339 aa
Genomic positionDPSCF300054 - 681393-704228
RNAseq coverage946x (Rank: top 14%)
Annotation
HeliconiusHMEL0136060.069.33% 
BombyxBGIBMGA010176-TA0.073.62% 
Drosophilaklar-PA8e-3430.54% 
EBI UniRef50UniRef50_D7EIJ72e-3934.80%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7EIJ7_TRICA
NCBI RefSeqXP_001122432.16e-5229.83%PREDICTED: similar to klarsicht CG17046-PA [Apis mellifera]
NCBI nr blastpgi|3504243088e-6230.60%PREDICTED: hypothetical protein LOC100747581 [Bombus impatiens]
NCBI nr blastxgi|3838515486e-7630.65%PREDICTED: uncharacterized protein LOC100880783 [Megachile rotundata]
Group
Gene OntologyGO:00037794.8e-16actin binding
GO:00160214.8e-16integral to membrane
KEGG pathway 
InterPro domain[1284-1339] IPR0123154.8e-16Klarsicht/ANC-1/syne-1 homology
Orthology groupMCL25707 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212022-TA
ATGGGACGCCCCGATGGGAACGAGCCTATCACACTTCAATCAAACGCCGATAAAGATGTATTACAAAGGGACGTGGAATGCCACCGTCGTATTGTCAGCTCGGTGGTGAGGCTCTGCGGTGGAGCGGACGTGGCTCGAGCTCTAGAACGTCGCTGGCATCTGCTGTATCTGCGGGCCATCGAGTGGCTCTGTCACCTGGAGGCCTGCATCGCTAAGAGTGATAATCAGAACTGCGCTTCAATAGAAGTAGCGAGTGATAGTGACGACGAGCCGGCTCTGAAGCAACCGCGGCTTACAAGACGTGGCTCACCAAGACGACAGAGGACACCTATTAATGTTATACGGAGGAGCAGCTCTGAAGATAGCCGGCAGTCTGCTTCCGAGGAGGAACAGGAGTATGCAGTGACTTACACCTGGCGCGGCTTCGGATCGGACTGTGAGACCGAGCTGGTGAGGGACAGAGACTGCGAAATGGACGAGAGACGTGTTCCGACCGTGGACGAGGGAGAAGACAGACAGACGGATCAAACAGTTCCCGGCGGAGATAATGTTGATGGTATCAGTGTACGCGAAACGGGGAGATTAGAAGATATACCAAGAGAAATGAACACTCCACCTACAGTTGTCAAGAGAAAAAAACCAGTGAACACATCGAAATTCAATCAATCTGATAGAAAATCTAAAAATTGTGCCACCTTCTACTTCAGACATCACGACACGGATTCTGATCGACAGGTCGTTGAGACTGACGAGAAGTCGCAAGAGGAATCGTCGGAAGAGGAATGGACTTACGTGGACGGTCCCAAAATAGCGAACGAAGATTCTACAGACTTAATGACGTCATCCATTGAAATACAATGTGACTTACCAGTGGATGATAAGAAAGAAAAACTTCTATCCCCGAAGAAATTGGAGTCCACAACGCCAGATCTTATAAGAGTCGATCAGAATTCAAGATGTAAAGATTTAGAACGATTGGTCCTACAGGCTGAGGAATTGGTTCAGAAACAGGCACAGCAACAGATGGCTAAGAAGAAGAATACACGAAACTTGAAACCATTATCATTTGACGAGGAAGGCAAGTCCAGCAGGGAGAAAATGTCAAGAATCAAGGAATGGTTGAATCAAAGTCCAGAAGACAAAAACGATAGCAATCAGTGCACTGAAAGCTACGATGCGTCCGGGGAATACACTACAGAGAGCGAGGTCGATACATCACTGACGTCAGAAGAAAGAAACATTCATTCATCAATGGACATGAGCACATCAACTTGTACTGTGACCCCGACGCATCACGCCAAGGTGACGCTGCGCAAGAAGAGGAATGCCACGCGGCCGTGGTCTGTGTCGTGTCTGTCTCAGCTCAGTGCTGGCATTATAGCCACGCCCACTGATGACGTCATTAACATGTCCATATCAGAGTCCGCACTGAACACACTGGCATCGCCTAGAAGAGTCACACCTGGGAATTCCAGCTCTAAATTAAGAGGAAGCTCAACTACTGTACAGGGTCACATATCAAGCACAAATACTATGACAGAGGCTTGTACGTCGTGTGTGGAGGCTAACGATAAACAATGTTGGTTGAGAAGGAAAAGGTTGAAACTGAAACGACAGAACAACACTAGAAGAGAGAGATTGGTTAAGAGTCTGTCGTTTTGTGGAAGGTTGAGTCCAGAAATTGAAGACAAAAACGAAAGAAGTGCGGCCAGTGACCCCGCCACTAGTAGGAAGAATAGATTCGATACAACATCAACTTCGGACAATGACAGCGACGCGGATCTTATCAAACAGCAGTTGGCAACCATAGCAAATCTGAGGAAAAGCATCGAGAAAACACACATCTCAAGCAAAGAACAAGACAGTGCCATCCCTGAGCAAAACGAAACAGAATTAGTCGGACCTAGCTTTAAACTGGGCCCCGAAGGTGGAACCGTACGACCTAGGATGACCAAAAGTATGGAGAGAGAGAGAACGTTTTTGGCACTTAGTCTAGGAGATCCCAGTCAAATGTGGGACCTCAGTGTTGATAAAGACTCTGAGAGCATAACTGCTGGTACCGAAGGCCATAGTTCTTTTTCAGAACAGGCTTGGGATTTCTATCAGGAGAAATACAACTCGGAACCGTATTCTGAAGCTCCGGATTCAGATGCAGCGAGACGGCTGTTGGAATTTGGGGATGACTATAGAGCATTCTTGGATTCCCAGTCTGATTGTTGTTCTAGTCTGTCCGCACATCCCGACGACACGAGTCCGACAACGAGACGACGACGACCGCCATCCGACACACGGGAAAGAAGCCTGCCGAGATATAAACGTCCAACTAGAACATCCCCCGTAGAAACACCATCACGCAAACCCAAGAAGTCCATGTCCAGCGCTGAACGGAAGAAAACACTATTGGACAGTCTGGAAAGATCCAGAAATAACACTAGCATTGAAAGCAACGAAGGGGTGCGACGAAGAAAACAGTCTGAGAACGAACGTAAGAACAGCAAACGCTCGCCCGACTTCGACCTCCTCAACGTCCAAGCTCTAAGCCGCAGACGCCACAGCTCAAACTTAACTAGCGACGAAATTAATAGTTCCCTGGAGTACACTGAGGTGAACAATCGACCGGATATCTTGGACTCGTTGAATAGACGTCGCAAGGAAAAAGAAGGAGGCACAGAGTTACAAAAGATCGCTCTGTCTGAACTACGACGACGGTCGACCGGCAGCGCTGATTCCGAGGGCGAATCCTCTTCACCCAAACACAGCCGAAAAAATTGGGGTGATTCTGATTCAGAGGCCGATGAGGTCAAATCTCTCGTCCGACGTTCGAGCACGCAGTTGGAGGTGACGGAGGCTTTACTGGCGCGACACGATTCATCGCCGGATATCCTGCGGGCCTTCGACTATACGGAGGTGGTAACTCGTTGCCGCGACAACATCAACCTTTTGGAGGTAGCTCTGTCTGAGGCGTCATTATCCCCAACCTTACAGAAAGAAATAAGAGCGGTGTCGGCAAGATGGTCGGCTCTAAGGGCTGCTGCGATACGTCGCGGAGGCGCTCGCCGCTTACGTCGCGAGATTGGAGCCCTAAAAGAGACTTTGGACGACATATGCGAGCCGGGAGACTATGCCCCCCAACCGCATACACGCGCACAACTACATAAGAGAATTGAGGAATTGAAGGAACGTCTATCGCGTCTGCTAGAATGTAAAGTCTCCATGTTGAAGCTGACAGTGTCAGTCAGGAGAGCTCTTGGTGAGATGGAGACAGATGATAGCGGATTGACGACTGAATTGACTTCCTTGATAGCTGCATGGGATGACGCCCATCAACGAACTTCAACGGAGTTACTGTCATTGGAGAAGGCGGTGTCAGCTTGGGCTGAATGGGAGCGTGCGCTGCGCGAACTGCAGGCAGCACTGCGTGGAGACCTGGCAGCTTTAGAAGCGCTGCGTGATCGACCCGATTGTGATGAACTCGCCTCCCACGTTAGACATCTAGCCGCTGCACTGTTTGATAAGAAAAAGGGCGGTTCCACGTGTGACTCTCTATCGGACTCGGGTATATCTGATGGCGACAGTGAAGGTGCCGGACGTGCGCGTCGTCTGACCGCTCTGAGGGAATTGGCGCGACGTCTACAAGCAGTGCTCGCTCCAAACTCACCCGCACATAGAGCTATAGCTAAGCGCATGGAACAAACAGAAAATGAAGTTAAAATTTTGCAAGAATCCTGCCGCGCCTTGGTCGAACAGAGTATACCAGATCTAAAAATTGACGAAGTGACGCGTGATCACACCATCGCTGTTTCAAGTAAGAAGACGGGTGCTGGCGATCCCGATTACAATCCACGCAGCGGCTGGGTGTGGCGTGTACTTCGTTCTTCTATCCCTATACAATTGTGTCTGGTTGCATTACTCCTAGCTGCGTGGCTGGTCGAGCGACCGCGGTGCTGCGATGCCTTGAATTCGCTCGCTCAAACCCTAACGCCACAGTTACGTTACGTCCGTGGCCCGCCCCCAGTGTGA

Protein sequence:

>DPOGS212022-PA
MGRPDGNEPITLQSNADKDVLQRDVECHRRIVSSVVRLCGGADVARALERRWHLLYLRAIEWLCHLEACIAKSDNQNCASIEVASDSDDEPALKQPRLTRRGSPRRQRTPINVIRRSSSEDSRQSASEEEQEYAVTYTWRGFGSDCETELVRDRDCEMDERRVPTVDEGEDRQTDQTVPGGDNVDGISVRETGRLEDIPREMNTPPTVVKRKKPVNTSKFNQSDRKSKNCATFYFRHHDTDSDRQVVETDEKSQEESSEEEWTYVDGPKIANEDSTDLMTSSIEIQCDLPVDDKKEKLLSPKKLESTTPDLIRVDQNSRCKDLERLVLQAEELVQKQAQQQMAKKKNTRNLKPLSFDEEGKSSREKMSRIKEWLNQSPEDKNDSNQCTESYDASGEYTTESEVDTSLTSEERNIHSSMDMSTSTCTVTPTHHAKVTLRKKRNATRPWSVSCLSQLSAGIIATPTDDVINMSISESALNTLASPRRVTPGNSSSKLRGSSTTVQGHISSTNTMTEACTSCVEANDKQCWLRRKRLKLKRQNNTRRERLVKSLSFCGRLSPEIEDKNERSAASDPATSRKNRFDTTSTSDNDSDADLIKQQLATIANLRKSIEKTHISSKEQDSAIPEQNETELVGPSFKLGPEGGTVRPRMTKSMERERTFLALSLGDPSQMWDLSVDKDSESITAGTEGHSSFSEQAWDFYQEKYNSEPYSEAPDSDAARRLLEFGDDYRAFLDSQSDCCSSLSAHPDDTSPTTRRRRPPSDTRERSLPRYKRPTRTSPVETPSRKPKKSMSSAERKKTLLDSLERSRNNTSIESNEGVRRRKQSENERKNSKRSPDFDLLNVQALSRRRHSSNLTSDEINSSLEYTEVNNRPDILDSLNRRRKEKEGGTELQKIALSELRRRSTGSADSEGESSSPKHSRKNWGDSDSEADEVKSLVRRSSTQLEVTEALLARHDSSPDILRAFDYTEVVTRCRDNINLLEVALSEASLSPTLQKEIRAVSARWSALRAAAIRRGGARRLRREIGALKETLDDICEPGDYAPQPHTRAQLHKRIEELKERLSRLLECKVSMLKLTVSVRRALGEMETDDSGLTTELTSLIAAWDDAHQRTSTELLSLEKAVSAWAEWERALRELQAALRGDLAALEALRDRPDCDELASHVRHLAAALFDKKKGGSTCDSLSDSGISDGDSEGAGRARRLTALRELARRLQAVLAPNSPAHRAIAKRMEQTENEVKILQESCRALVEQSIPDLKIDEVTRDHTIAVSSKKTGAGDPDYNPRSGWVWRVLRSSIPIQLCLVALLLAAWLVERPRCCDALNSLAQTLTPQLRYVRGPPPV-