Monarch geneset OGS2.0

DPOGS204633
TranscriptDPOGS204633-TA2460 bp
ProteinDPOGS204633-PA819 aa
Genomic positionDPSCF300277 + 89915-92374
RNAseq coverage1211x (Rank: top 10%)
Annotation
HeliconiusHMEL0105700.085.22% 
BombyxBGIBMGA009463-TA0.081.18% 
DrosophilaHcf-PC9e-17467.71% 
EBI UniRef50UniRef50_E2BBL60.072.55%Host cell factor n=4 Tax=Formicidae RepID=E2BBL6_HARSA
NCBI RefSeqXP_318042.40.044.63%AGAP004774-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3072102430.072.55%Host cell factor [Harpegnathos saltator]
NCBI nr blastxgi|1571200920.046.08%host cell factor C1 [Aedes aegypti]
Group
Gene OntologyGO:00055151.6e-31protein binding
KEGG pathwaymcc:6981403e-177 
 K01787 (RENBP)maps-> Amino sugar and nucleotide sugar metabolism
InterPro domain[123-336] IPR0159151.6e-31Kelch-type beta propeller
Orthology groupMCL17742 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204633-TA
ATGAAGGAAAACGCTGTACTTAAGTGGCAAAAAGTTTACAATCCTAGCGGCCCTCAACCACGACCTCGCCATGGCCATCGTGCAGTTGCCATTAAAGATTTGATGATAGTTTTCGGCGGCGGAAATGAAGGGATAGTTCATGAGCTTCACGTCTTCAATACCACCACGAATCAATGGTTTGTTCCTGTTACGAAAGGAGAGGTGCCTCCAGGATGCGCCGCCTACGGCTTCGTGGTCGACGGAACACGTCTCCTGGTTTTTGGCGGCATGGTAGAGTACGGCAAGTATTCCAACGATCTGTACGAATTACAGGCCTCCCGGTGGGAATGGAAACGTCTAAAGCCTCTGCCACCCAAGCAAGGCCTACCACCATGCCCAAGGTTAGGTCATAGCTTTACTCTCTTAAATGGTAAGGTTTACCTGTTCGGTGGACTGGCCAATGAGAGCGACGATCCTAAGAACAACATACCAAGATATCTCAATGACCTGTATACATTGGAGCTGTATCCGAATTCATCTATGACGGTATGGGACATACCCATCACTTATGGTCAGTCACCACCTCCTCGCGAGTCACATAGTGGTGTGAGTTACACAGATAAGAATACTGGAAAATCCTCGTTGATCATATATGGAGGTATGAGCGGCTCACGTCTCGGTGACCTGTGGGTGCTTGATGTGGACAGCATGACTTGGTCTCGGCCAGACCTGGGTGGCCCTCCACCACTACCCCGCTCTCTTCACACAGCAACTGTCATTGGACATCACATGTATGTGTATGGCGGTTGGGTGCCATTGGTTCCAGATGAATCAAAACTTGCAACACATGAAAAGGAATGGAAGTGCACCAACACACTTGCTTCTTTAAATTTGGATACCATGACTTGGGATTGCATTGCCCTTGATAAGTTTGAAGAGTGTGTACCCAGAGCTCGAGCCGGGCACAGTGCGGTTGCTATTCAAACAAGACTCTACATCTGGTCAGGCCGGGATGGCTATAGAAAAACTTGGAACAATCAGATTTGCTGCAAAGATCTGTGGTATCTTGAGGTTGGTGTGCCACCACAAGCGGGTCGTGTAGCACTTGTAAGAGCAGGCACAACTTCACTTGAACTCTGCTGGCCCGCTATGAACACAGTCACTACATATCTTCTACAAGTTCAAAAATGTGGTAAAGTGACTCCCACAAGGTTCCCGACAGCTACGGAACCGGTGGCAGCTCCTCCTCCTGCCTCACCTGTCTCTGGACAACCGGATGCTGCCAAAGCGTTTGGACTCACATCACCTATTGGGACTGGACTTCCACCAACCCCAATTGACCTACCAATAAGACCTGCGGCAGCCGCGGCGTCTCCAGCTGCCAATCCCATTGTATCAACTCCACAGAAAGTCGTATCCAGTGCTATTAAAATGCCTGGTCAAGCTGCCGTCAAAATATCTCCTAACACTCCGAAACAAACCTACCACGGCAAAACAGTTGTAAAATCACCCGCTGCAGGCTCTTCACAGCAGATAAAGGTCGCTGCGGTCACGCCACAAGGAGTGACTCGTATTGTAAGCGGAGTAGCGACTCCTAACACAGTTCGTGTGTCCACTCCGCAATCAAACGCTCAGATAGTGCTCGGCGCGGCGGCGGGCTCGGCTGGCTCGCCGAGGTTCGTTCAAGTTAAAACTGGCGGTAACACTGGTGTCGTCAAATTAGGAAGCAGTAATGTGCCTGTGAAACTCGCGGGCAATGCCGTTCCATTAAAAATTGGTGCAGCCAATCTCCAAGGCAAAATCACAGCAAACAGTGTACAGAAGGTCGGCACGACTCCGTTGAAGCTAGGTGCTGGAAATGTAAAAATAAGTACGAGCAATGTGTTAGTGAAGACAGCTGGAGGAGTTCCCATACAAGCCGGAGTCGGCAGTGTGCAAGTCAAGGCTGGAGTGCCGGTGAAGCTGAGCGCTGGTAATCTGCCGGTAATCGGAAGCTACGGTGGCGCGATACAAATCGCAAGCGGCGCCATCGGCCAGCAAGTGAAAACTCCCGTTTATAAAATAGTGACGGCTAAATCTAGCGATCAAGGAACGGCTGTCACGAACGCCGTGTCAGGATCTCCGGTCCTGAGGCAGGCCGGAGGTAACGTCATCATTAAGAAGACGCCGGGTGCTAGTTCACAATCAAGCACGTCGCCCCAGTACGTGACACTGGTGAAGACCTCTACAGGTATGACCGTGGCGACTGTACCCAAGATGGCTGTGATGCAGAACCGACCGGCGACTCCGGCCAGTGCTGCACAGGGGATTACTCCTGGGGCGACGATCGTCAAGCTCGTATCGGCTAACTCAGTAGGCGGCAACAAGATCATAACACTACCACCCAATAAGCTGCAGCTCGGCAAGACAGGTGTTGGAGGCAAGCAGACCATAGTTATCACCAAGTCAGCGAGTCAGTCACAACAGGGGCAACCGCAGTGA

Protein sequence:

>DPOGS204633-PA
MKENAVLKWQKVYNPSGPQPRPRHGHRAVAIKDLMIVFGGGNEGIVHELHVFNTTTNQWFVPVTKGEVPPGCAAYGFVVDGTRLLVFGGMVEYGKYSNDLYELQASRWEWKRLKPLPPKQGLPPCPRLGHSFTLLNGKVYLFGGLANESDDPKNNIPRYLNDLYTLELYPNSSMTVWDIPITYGQSPPPRESHSGVSYTDKNTGKSSLIIYGGMSGSRLGDLWVLDVDSMTWSRPDLGGPPPLPRSLHTATVIGHHMYVYGGWVPLVPDESKLATHEKEWKCTNTLASLNLDTMTWDCIALDKFEECVPRARAGHSAVAIQTRLYIWSGRDGYRKTWNNQICCKDLWYLEVGVPPQAGRVALVRAGTTSLELCWPAMNTVTTYLLQVQKCGKVTPTRFPTATEPVAAPPPASPVSGQPDAAKAFGLTSPIGTGLPPTPIDLPIRPAAAAASPAANPIVSTPQKVVSSAIKMPGQAAVKISPNTPKQTYHGKTVVKSPAAGSSQQIKVAAVTPQGVTRIVSGVATPNTVRVSTPQSNAQIVLGAAAGSAGSPRFVQVKTGGNTGVVKLGSSNVPVKLAGNAVPLKIGAANLQGKITANSVQKVGTTPLKLGAGNVKISTSNVLVKTAGGVPIQAGVGSVQVKAGVPVKLSAGNLPVIGSYGGAIQIASGAIGQQVKTPVYKIVTAKSSDQGTAVTNAVSGSPVLRQAGGNVIIKKTPGASSQSSTSPQYVTLVKTSTGMTVATVPKMAVMQNRPATPASAAQGITPGATIVKLVSANSVGGNKIITLPPNKLQLGKTGVGGKQTIVITKSASQSQQGQPQ-