Monarch geneset OGS2.0

DPOGS210636
TranscriptDPOGS210636-TA2280 bp
ProteinDPOGS210636-PA759 aa
Genomic positionDPSCF300168 + 674198-677432
RNAseq coverage59x (Rank: top 68%)
Annotation
HeliconiusHMEL0129174e-16046.70% 
BombyxBGIBMGA013588-TA1e-9136.61% 
DrosophilaCG42399-PB1e-0825.12% 
EBI UniRef50UniRef50_B0WF531e-2830.77%Putative uncharacterized protein n=2 Tax=Culicinae RepID=B0WF53_CULQU
NCBI RefSeqXP_001648160.13e-3134.45%hypothetical protein AaeL_AAEL014174 [Aedes aegypti]
NCBI nr blastpgi|1571038616e-3034.45%hypothetical protein AaeL_AAEL014174 [Aedes aegypti]
NCBI nr blastxgi|1700390058e-2930.77%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00054881e-10binding
KEGG pathway 
InterPro domain[526-717] IPR0160241e-10Armadillo-type fold
[640-715] IPR0119892.7e-06Armadillo-like helical
Orthology groupMCL26097 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210636-TA
ATGGAAGATGAAATCGAAGATCTAGTGGGAGACGTCAGATACACAGACTCTCAGAAGAGTGTAACCACATCTAACGTATATTCATTCGATTTCACCGACAGCAGCCCGGATGGAAACAACTCCACGTCGAAGGCTACTTACGTCATTGACTGTGAAGACAGTGATGTCAGTGAGGCGATGTTTCAAACGCAAGAAAACTTGACACCTCACAAAGTAACCGATACATGTTTCATTGTAAATCGAAATTCTCCAGACAACATAATATATGCGAAAGACCAGTCCATGTGCACAAATTCTTCAAAAATATCCATACTGGCTCAAGAGACATACACGCAAACAAGTAAAACCATAATTGAATTGGTTCACGGCAATCCACATAAAATTGAATGCAGTACATTAGAAACACAAACATCATTTGTATCGATAACCGAAAACAGGACTTTAGAATACAGAATCCATCTCGTTAATGCTAATGGAAATGACGAGAGAGATGTCGTTAATAATCTCAACGATACAAACATCGAAAGCCACCCCATCCTAGCGAATAGCTTAGAAGACAAATCGTCAATTTACAAAGAATCGGAGACGGAAAATGATGTAACAAGACAAAGTTCAAATGATACAATAATTCGATTTATAGATCAGGATTCCAAAGAGGATTTGTCGGAAAGTAAAAACGAAACGCCGTACGATAGTACCAGTGTCTACCACTCGTCATTCGACTCGGATATGGAAGAGGATTCTCTGATGGAGAACGATTACCATAAGAAATGTAATAAAGACGATACCTTTGATGAGGGGGAATTTGAGTTTGTGGACCACGATGTTAAGGAACTGTACAACAAAATTTCAGAAAGCCCTCTATTGCTATTACAAACGCATTCTCCAGAACATTGCAACCGTAACTTTTCCCCGCTCACACCATTAACCGAAGAAACTTCACATAGAAAAGACAATATCGTAGATGTCACACCGAGTGAAAAGACATTGACGGGAAATGAGGATAGCAATGACACCATATTTGTTAATAGTTGCGGTATTCGAGTGAAAATGTTGAACGATAAAAATGGAGGTTCAGCTGCAGATGACATACTGAAATTACCTCCGATTCAAAAGAGTCCAAGATGCCCAAATAGTACGCATTTTAATTTATTATTTTCCTTGAACACGCCTCAAATGACTACAAAAAGGGAAGAACCTATACCGAATCTATACAAAAGTCAAAATTATAAAATTGAAGACCGTTGGGAAATGACGCGGAGGGATTTGGCGTCTGGAGAAAGTGCTTTCATAAGTGGCAATAGTGATCCTCCAAAACCGATTAACAATTCTAACCAACTGCCTCCAATTCATCTGGAAGGCGTTTTCCCTACTTTTGAAAATAGAATGGACCTAGCTACGAAATTCGAAGAATTTTCAAAAGATTACGATGCTATCAGCATTAAAGGAAAAATAGGGGAGTTAAAGATGTTCTCAAACGATATCAGGCAAAGATCTGGCAACAGAAGTATTTCACCTACCAGCACCTCATCGCCTGGAGATAGCAAACTGAGTGATGTAGCTGAAAAGGGCTGTGATGCACTTTGTTCAGAATTATTAAAGAGACTGCGTTCTTCGTCCTGGCATGAGGTGATTGAAACTTTAGAAAATTTACCCAAAGCATTGGACTCATTTTGGTGTGTTATCACAGAAAACCGGATCGCGACATTAATACGACAGGTCATTGTACACATTGAGTCGCCACGTACACAGGTTGCTAAAGCAGCCTGCAGCTCGCTGGCTGATATTCTAAGGAACACTAATTACACAAAGAAACCCGATTTCTACGAAGCTATAGCCATTTTACTTATTAAAACAGGCAGCTTCAGTCGGCCGGTGAGACGCGCCGCTAACGTGGCGCTAGACGACATAGTCTGTAGTGTCGACATCACACAGTCTGTTACCGCTATATGCGTATATGGAGTTGGACATAAAAGTCCATTTGTACGCTGTGCCTCCGCTCGTCTGCTGGTGGTATGTTGTGCTCTGGCCGGTGGAGGGAGACGCGTCCTCCGGGAGCGACCGCCAACCGCCGCCGCAGCCCGTCGTCACGCACTGAGAGCGCTGGCGGAACTGCTCTATGATAAAAATACCGATACCAGGAAATATGCTGAAAGGCTTTACCTCATGCTTCGACCGTTACCAAACTTCGAAGCCTACTTCCTCACAGACGTGGACGTGGAACTTGCCTCGCAGCACATGAAGAAGTTTGACCGTCTCCTCAACAAGCAACCCAGATGA

Protein sequence:

>DPOGS210636-PA
MEDEIEDLVGDVRYTDSQKSVTTSNVYSFDFTDSSPDGNNSTSKATYVIDCEDSDVSEAMFQTQENLTPHKVTDTCFIVNRNSPDNIIYAKDQSMCTNSSKISILAQETYTQTSKTIIELVHGNPHKIECSTLETQTSFVSITENRTLEYRIHLVNANGNDERDVVNNLNDTNIESHPILANSLEDKSSIYKESETENDVTRQSSNDTIIRFIDQDSKEDLSESKNETPYDSTSVYHSSFDSDMEEDSLMENDYHKKCNKDDTFDEGEFEFVDHDVKELYNKISESPLLLLQTHSPEHCNRNFSPLTPLTEETSHRKDNIVDVTPSEKTLTGNEDSNDTIFVNSCGIRVKMLNDKNGGSAADDILKLPPIQKSPRCPNSTHFNLLFSLNTPQMTTKREEPIPNLYKSQNYKIEDRWEMTRRDLASGESAFISGNSDPPKPINNSNQLPPIHLEGVFPTFENRMDLATKFEEFSKDYDAISIKGKIGELKMFSNDIRQRSGNRSISPTSTSSPGDSKLSDVAEKGCDALCSELLKRLRSSSWHEVIETLENLPKALDSFWCVITENRIATLIRQVIVHIESPRTQVAKAACSSLADILRNTNYTKKPDFYEAIAILLIKTGSFSRPVRRAANVALDDIVCSVDITQSVTAICVYGVGHKSPFVRCASARLLVVCCALAGGGRRVLRERPPTAAAARRHALRALAELLYDKNTDTRKYAERLYLMLRPLPNFEAYFLTDVDVELASQHMKKFDRLLNKQPR-