Monarch geneset OGS2.0

DPOGS207859
TranscriptDPOGS207859-TA3888 bp
ProteinDPOGS207859-PA1295 aa
Genomic positionDPSCF300042 + 1699313-1710028
RNAseq coverage46x (Rank: top 71%)
Annotation
HeliconiusHMEL0084230.068.66% 
BombyxBGIBMGA009942-TA0.048.69% 
Drosophiladila-PB9e-5426.12% 
EBI UniRef50UniRef50_UPI00022AFE391e-5430.09%UPI00022AFE39 related cluster n=1 Tax=unknown RepID=UPI00022AFE39
NCBI RefSeqXP_001651110.19e-5325.45%autotransporter adhesin precursor, putative [Aedes aegypti]
NCBI nr blastpgi|3485224434e-5430.09%PREDICTED: 5-azacytidine-induced protein 1-like [Oreochromis niloticus]
NCBI nr blastxgi|1571104561e-7324.73%autotransporter adhesin precursor, putative [Aedes aegypti]
Group
KEGG pathway 
Orthology groupMCL25671 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207859-TA
ATGTCTAAAGAAAATAATAATTTAAGGCTTCTCGGCTCACCAGTAAATTTAACTTATAGAAACAAGAGAAAAGAAGATAGGAAAAATTCAAGAAATCGTCCAAGATCTGCTCTACAAAACTCTAGTTCTCCGGATGTTCCAGAAAGATACAAGAGACCATTCTCAGCGGACACCAAGGAACGAGGCCAATCCACAAGATCCTTTTATAAAACTTTTAGCGCTGATTTATTACAGTCTTACAATAACTCCCCGTTGAGTGTGAAAGTGCTCCCTCCCACAGAAGACCTTTTGACCCATTCAAATGTTCACATCACTAGAAACGAAAACAAAGAGATAAGCAGCAACGCATCTGATTACGGTTCCGAAGACACGTTTATTAGTTTGGGAACAAAGATAAAAGCAAAAGCTCAAACTGTCTGCAAGAATAGAAATACGAATCCAAAGAATTTCTTAAAATACAGGACTATTGCAAAGAAAGGACGGAAATCTATGGAGAATTTAAATGAGACGAATGACAATAATTACGGTTTAGAGATTACAATCAAGGAGAAATCTGGACCGCCGTCGCCTACGAGGAGTACCGATTTGTTCCCATTGAGGCCCTCGTCTCCGTGTCGGAATAAGTCGTACGAATCCTATTTTTTAGCCCTGGAAGACGTTAGGAACGGTGATGGTATTGTAGGCAGAGTATCGTTCGCACCGGCCGATAATAAAATTGACAAAAATCTTGACAATCCGCGCCGTTTGAGCCTGACGAGACAACAATTATCGTTAGTTGAAGAGGAATCAGCTCAAGACATTGATAGCATACCATCTCAGAATACTGAAAACCTTTCACCAAAAAACAAATATAACTGTAAGGAAATATCTAACAACGATATTAATTACGATAAGAACGACAAGGGAACTTTATTAGATAATAATAAAGATTTTAATAGTGCACAAAATTATAACGCATTTTACGATGACATCCACTCAAATACACCAGACTCCTTCAACACTAGATTATCTAGTACAGGCATTCATACAGATTCTTCAAAAGATTCAGGGTACCCGGACAGTGTCAATAAAGAACATAGAACGCTGACGCAAAACTATTTACTTACACCTGCGCCTGATTCATATGACTTTAATAGCAAAACAAATCCGACTAACAATTCGGAATATTCCACAGATAATGATAAAGCTTTTAAAACAAACTTCAGCAATAAATGGTCCGAACCATGGAATCACAGCAGATTACTCTACAAAGATTTCTTTTTAAAGAAAGAGACTCACGGCCCAGCCCCACAGAACACACCAACAAAAAATGACGTTCATATCCCGGGGACTAACGAAAACATAGAGGGAGACACAGAAAAGTTGGAATATCCTACTTACCTCCTAAACAGTTCTACAAAAGCTTACACCTCAAAAGTTATTGAAGATTACAGGAAAGAACTAGAGGCTATAAACACTTTACACGAATTAACAGTTAAAGATATAAAAATAGACCCCATATCCCCTACTCCTCTTAGTATAGACGAAATGTTTGAGCAACATAGTAATCGTTTTAACGATGGTAAAACAGACTTGAGCGAAAACTCGCAAGAAAGCACTAATAACAGTGACAGTACCGACAAAAGCTCTCTAAACAACAAGAGAGATATATCCAAAGTGCCAACGAGAGAGCTGATACAGAACTACTTCAAAGTCAAAAGTGATTGTACAAAGGAGTTTCCGAGAAATGCAAAAAAATATGACAAAAAACTCAACAATATAAATCCGAAATACGAAGAAAATTCCAGTTATAAACAGTATTGGAACAACAGGAATGCAAAGAATAGTGTAGAGAGAAGTAAAACTCCCGTTGATGTGAAGACCCTGAACAAGGCGGTGACAGGCAGAGCGCCGTCTAGTGCTAGAATCGAAAGTGTCCAAAACGACAAAGATATTGAATCATGGATGTCTTTATCAGCTCCGTCACCGAGAATGCTAGAGACTGATAACGTCAAAGATAACACCGCAGAACCTCCAAAAGCAGTACCAGTTATTGATAAAATCGAGGAAAATGACGAAGCCGAATTTAATAACCAAAGGACAAAAACATCAGAAGCCCCAAAGCCAAAGGAGCTCAACTCTAAATCTACTATCGTCGACATTTACTCGATGTTGAAGGAAATTGAAAGTTTTGGTGATAATCCTGTCACGAATATTGATGAACCGAGCGAACCGAAACCGAAACAGGAAGATAGATGTTCAACACCCAAAGATAACTTTATGGAGATCTTTGAGTTTTTGGAAAAGGTAGAACAAAGCGCGAACGATGCTCTATCAGTCGTCACCAACACAACACCGCAAACTATACCCAAACTTGAAGCTCTACTAAAGCTGCCACAAACGGAGTTAGCTCAGAGGCTTGTAACGCTATCATTACAACTAGAGGAACGATCCTGTTGCATTGCCTTACTTCAAGAAAGTCTCGCCAATCATAAGGAACAGATGATCAACAAAGTTAGCAACCTCGAGAAGCAATCACATCGGAACATAGCCAAGGTTAAACAAGAATGCGAAGAGACGATAAAGAGACATCAGAATTTTATTGATCAGCTAATAAACGACAAGAAGACACTGAACCATCGCATTGAGCAGTTGGTTGACGAACGTCGTACTCTTGAAGAGAGGTGGAAGAGATCCGCTCAGACATTGGAAGAACGATACAAACTTGAGCTGAGAAATCAACACGACAAGATGGCCGCCGCTCAGCAAGTCGCACGGCAGCGGTGGGTGCGTCAAAAAGCTGAGAAAATTAAGGAGCTTACAGTCAAAGGTCTGGAAGGAGAGTTACGAGAGATGGCAGAGAGACAACAAAAAGAGATATCGGACCTGAAAATGTTCCACGCGGAACAATGTGGGAGAATGAGCGCGAAACACGCAAATGACTTAGAGGAACTAAGGAGGAGTTTAGAGGAAGAAAAGGAGAATGCCCTGATAAAAGAGCGACAGTTGGCCAGTTCTAGACTAGAGCGTCAGTTACTTGAGTTGGAGCTGTCTCAGCAGGAACAGCGTGCTAGGCTGGCGGACGAGCTGAGGGCTGAGGGGGAGAGACTGGAGGGAGAGAGGGCAGCCAGGGAGAGAGAACATAGAGAACAGATGGCCAAATGGAAAGAGGAGCAAGAGAAAATGTTGGATGAGAGAAAACAACAGATAGAAAAGGACATAGCAAAGGAAAGAGAAATATTTGAGGAAGAATTAAAGAAGAAACGTCTAGAACTTGAAAAAGAATTCAATCAATACAAGAAAGAATATGAAGCGGACCAGCAGGTGCTGCTGATGAAGAAGGTGACAGAGATAGCGGCACAGCATAAGATAGAGAGAGATAGGGAGATAGAGAGAGCTATAGAGAGCATGGAGAGTGAGGCTCAGGCTGGCCGAAGAGAACTACAGGAGGCATTGAGGCGGAATAAAGAACAATACGAAATCGAATTGAAGGAATTGGCGGAAACGGAACAAGCGACGCTGAAGAGATACCAAGACGCACAAGCTCGGATTAGAAGCACAGAGGATAAATGCGCAGAATTAGAAGTGGCTATCAGTCAGACTGAGTCCCGGAATAGAATTTTAACAGAGAAAAACTCTCAACTAGAGTCCAGAGCGGATGAGGTGAGATTGAGTTGTGAGGAGTCTTGGAAAAACAAAGTGGAGAATTTACAGAAGGACATGGAGAACATGAAGAAAACACACGAGGAACAGATGCATCAGCTTTATGCTAAGGTGAAAGTCGCTGTGGCGAGGAAAGATTCAGCGATACAAGCACTAACAAGAGAAACAGCGAAATATCAAGAGAAAATCACAATATTAGAACAAAAACTACAACAGCAGAGAAAAGATTTTCTTAAATCGAAATAA

Protein sequence:

>DPOGS207859-PA
MSKENNNLRLLGSPVNLTYRNKRKEDRKNSRNRPRSALQNSSSPDVPERYKRPFSADTKERGQSTRSFYKTFSADLLQSYNNSPLSVKVLPPTEDLLTHSNVHITRNENKEISSNASDYGSEDTFISLGTKIKAKAQTVCKNRNTNPKNFLKYRTIAKKGRKSMENLNETNDNNYGLEITIKEKSGPPSPTRSTDLFPLRPSSPCRNKSYESYFLALEDVRNGDGIVGRVSFAPADNKIDKNLDNPRRLSLTRQQLSLVEEESAQDIDSIPSQNTENLSPKNKYNCKEISNNDINYDKNDKGTLLDNNKDFNSAQNYNAFYDDIHSNTPDSFNTRLSSTGIHTDSSKDSGYPDSVNKEHRTLTQNYLLTPAPDSYDFNSKTNPTNNSEYSTDNDKAFKTNFSNKWSEPWNHSRLLYKDFFLKKETHGPAPQNTPTKNDVHIPGTNENIEGDTEKLEYPTYLLNSSTKAYTSKVIEDYRKELEAINTLHELTVKDIKIDPISPTPLSIDEMFEQHSNRFNDGKTDLSENSQESTNNSDSTDKSSLNNKRDISKVPTRELIQNYFKVKSDCTKEFPRNAKKYDKKLNNINPKYEENSSYKQYWNNRNAKNSVERSKTPVDVKTLNKAVTGRAPSSARIESVQNDKDIESWMSLSAPSPRMLETDNVKDNTAEPPKAVPVIDKIEENDEAEFNNQRTKTSEAPKPKELNSKSTIVDIYSMLKEIESFGDNPVTNIDEPSEPKPKQEDRCSTPKDNFMEIFEFLEKVEQSANDALSVVTNTTPQTIPKLEALLKLPQTELAQRLVTLSLQLEERSCCIALLQESLANHKEQMINKVSNLEKQSHRNIAKVKQECEETIKRHQNFIDQLINDKKTLNHRIEQLVDERRTLEERWKRSAQTLEERYKLELRNQHDKMAAAQQVARQRWVRQKAEKIKELTVKGLEGELREMAERQQKEISDLKMFHAEQCGRMSAKHANDLEELRRSLEEEKENALIKERQLASSRLERQLLELELSQQEQRARLADELRAEGERLEGERAAREREHREQMAKWKEEQEKMLDERKQQIEKDIAKEREIFEEELKKKRLELEKEFNQYKKEYEADQQVLLMKKVTEIAAQHKIERDREIERAIESMESEAQAGRRELQEALRRNKEQYEIELKELAETEQATLKRYQDAQARIRSTEDKCAELEVAISQTESRNRILTEKNSQLESRADEVRLSCEESWKNKVENLQKDMENMKKTHEEQMHQLYAKVKVAVARKDSAIQALTRETAKYQEKITILEQKLQQQRKDFLKSK-