Monarch geneset OGS2.0

DPOGS210823
TranscriptDPOGS210823-TA2490 bp
ProteinDPOGS210823-PA829 aa
Genomic positionDPSCF300027 - 464308-467137
RNAseq coverage249x (Rank: top 42%)
Annotation
HeliconiusHMEL0127630.066.99% 
BombyxBGIBMGA007138-TA0.052.80% 
DrosophilaCG7044-PA4e-7525.71% 
EBI UniRef50UniRef50_B0WL503e-9330.18%Putative uncharacterized protein n=2 Tax=Culicinae RepID=B0WL50_CULQU
NCBI RefSeqXP_313823.43e-9529.15%AGAP004523-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1700435221e-9230.18%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700435225e-9629.71%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00054886e-12binding
KEGG pathway 
InterPro domain[313-715] IPR0160246e-12Armadillo-type fold
[439-607] IPR0119894.9e-10Armadillo-like helical
Orthology groupMCL15540 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210823-TA
ATGGATTCCAGTCTAAGAGTTGCTTATATGGAAGTAGCTCTGTCACTAACCAAACATAATTCCGGTATCTATTGGCTTTTAGAAACTGGAATTTGGAAAGAAATTCTACAGCTCTGTAATGAAAAACGGACAGTTTTTGTTGTCCGACAGACATACAAATTTGCTTCATTATTTTTGTGGAAATTGGTTGATATAAATGAGGAAGCTAGCATTAAAACCGTTTTAAATTTTATACTCAAACCTATGTCAGAAATTGATATGATCAATATAAATTCCATGTCAAGCGAGTATGAAGACGAGTTGTGTAAAGTATACGTACCGATGCTACAAATACTGTTGTCGGTGGTGGGCAATGCAGAGCGCATTAAAACACGTAATTCTGTGATAACGTCGATGATCAAAGATTTCAACATGTTAACATTCTGTTATTTGATAAAGACTAGAATAAGAAGGGAGGACGTACTTCTGCTTGTGACAAAATTATTATTTTGGCTTTCGATTGGTAAAACTTTTATTTTTAAACCGCTACAATTATCGGAGAGGTTCGAACGGGACGATTTCGTGGAGGTCACGATAACTTATTTCAACACAGTGAACTATCTCATGCAGCGTCGCTGTTGGGCTTTAGTGTTCGATTACTGTAACGCCTGTAATTTAATATTCAGCTCGGTCTGGAGCAACATGAGACCGGCGGTCTTCGAAGTAGACGGAAGGGAGGTGGAATTGCAGAAGCAGTTACTCGTCATATGTCTCATACCGTCCATGGTGTACATAGGTGCCGGGAAGACAATGGGAATCGACGGTGACGAAGTCGATAATTTTATTATTAAACTATTGCATTCAACTTGCGAGCACACTGCAAGGACATGTTATGCTCTCAGGGATCTGTTATTGCAGTTGGACATGGAGTCCGTGACCCTTCAGAGTGTGAAACGTCTTACTTGTTTGAAAGATCATTTAAATAATGACCAAGCGAACCTGCTATTCCAGGCACTATTCTACGTCCTTAAAGAATACGATCCTATAGACGAAAACGGGGTAGTGAAAGCGGATATAAATATTACAGATAGCGAAGAGAAAGTACTGATTATGACATACGTTTTGGACATACTGCTGTCGCTGGTTAAGAATTATAACATCAACTGGAAGGAGAGCCTTGAAGTCATTTGTCTTTATAGCGTTGTATTTAATATTTTGAAGATAAAGAATAACAATTTCTCTAGTAGGTTTGTAGTGATCGCATTAAATGTCATCACGATAACAGTGAAGAAGTTTCTACCGCCAACCCTATCTCTGTTAATGGAGTCCAAGCCTGGTTCCTCGATGGATGAACTCGGAGAACTAATTTATATGAAATTAAACGATTTCCAGTGGGAGGTCCGAGATTCCGCTCTGGAATTGCTATATGTGTGCACAGACATCTGCTTTATTAAGTTTCCGCCGTTCCAAAAACAAATTTTATCTAACAACCTCATCAATCTGGCGACAACCATGGCGCTGAATGACCACGAGTTCTATGTGCGTGTTTCTGCTCTGAGGTGTCTTGGAGCTGGTTGTAAAGTCGCCTCACTCTGGGATCACTTAAAAACTCAGTATCCCAACATACAGGAACTTCTAGTGGACATCATGAACACCAACCAAGAAGGCGTTGTACGTAAAGAGGCCTGCAACGTTTTATGCGAAATTTACCAAAGCGTCAAAATTAGCCCGAACTTCAAGTCCGTTCTATACGAGAACATGATGAACGCAGCGCTCTCTGATTTTCACTGGGAAGTTCAGCTGAGCGCACTTAAGTTCTGGAAAATAGTGATTCAATCCTTGCTCACCGCACAGGGCATGCTCGACGGCACATTTCCCCCGGTGACGTTTTCCAGACAGACAAGGAAAATTGTTACTCTAGACGCGAACGAAATCAAAAGGCGTTTGACGGCGACCCTTGAAGAACTGTCCTCAATCGGATGTTTAACTGTGTTAGTGAAACTCCTTCATGACGATACTGATGTCGAAATTATGGATTCTGCTAGGATTATTTCTACCGAACTTCTAGAGATACTTGATCAATACAATGTTCCTGAAACCTTGACACCAAGTAACAAAGAATCAAACACCATGGATGAGTTGCAGCAGCAGAACATTTCTGATGACAGTACTGGAAATGGTGACACTATGGACTCAGAACCCGCTACCTCATCGGAGAATGTGATAGAAAGTATATTGAATTCCGATGATATTAACTTACTTGCAAATATATATAAAAGACAAATGAACCTATCACCGGAACAGGAAACTAAAAACACAAGTCACACAAAAGTTGTAAGGTTAGCATCGCCATACTTATTTGTTAGATATACCAGGAGTAAAGACTTCAAACAAATCATAGAGGACAAAAGAAATTGGAAAGATGGAATCAAAAGTCTTTCGTCATTACTAGACGATGTTCTGGGCATATACGAATTCAATGAGGAGGTGAACTCACTAGACTGCTATTGA

Protein sequence:

>DPOGS210823-PA
MDSSLRVAYMEVALSLTKHNSGIYWLLETGIWKEILQLCNEKRTVFVVRQTYKFASLFLWKLVDINEEASIKTVLNFILKPMSEIDMININSMSSEYEDELCKVYVPMLQILLSVVGNAERIKTRNSVITSMIKDFNMLTFCYLIKTRIRREDVLLLVTKLLFWLSIGKTFIFKPLQLSERFERDDFVEVTITYFNTVNYLMQRRCWALVFDYCNACNLIFSSVWSNMRPAVFEVDGREVELQKQLLVICLIPSMVYIGAGKTMGIDGDEVDNFIIKLLHSTCEHTARTCYALRDLLLQLDMESVTLQSVKRLTCLKDHLNNDQANLLFQALFYVLKEYDPIDENGVVKADINITDSEEKVLIMTYVLDILLSLVKNYNINWKESLEVICLYSVVFNILKIKNNNFSSRFVVIALNVITITVKKFLPPTLSLLMESKPGSSMDELGELIYMKLNDFQWEVRDSALELLYVCTDICFIKFPPFQKQILSNNLINLATTMALNDHEFYVRVSALRCLGAGCKVASLWDHLKTQYPNIQELLVDIMNTNQEGVVRKEACNVLCEIYQSVKISPNFKSVLYENMMNAALSDFHWEVQLSALKFWKIVIQSLLTAQGMLDGTFPPVTFSRQTRKIVTLDANEIKRRLTATLEELSSIGCLTVLVKLLHDDTDVEIMDSARIISTELLEILDQYNVPETLTPSNKESNTMDELQQQNISDDSTGNGDTMDSEPATSSENVIESILNSDDINLLANIYKRQMNLSPEQETKNTSHTKVVRLASPYLFVRYTRSKDFKQIIEDKRNWKDGIKSLSSLLDDVLGIYEFNEEVNSLDCY-