Monarch geneset OGS2.0

DPOGS212289
TranscriptDPOGS212289-TA1719 bp
ProteinDPOGS212289-PA572 aa
Genomic positionDPSCF300077 + 665898-670585
RNAseq coverage287x (Rank: top 38%)
Annotation
Heliconius% 
BombyxBGIBMGA011449-TA5e-3932.84% 
DrosophilaCG42678-PI2e-1440.82% 
EBI UniRef50UniRef50_E9I9I02e-2251.80%Putative uncharacterized protein (Fragment) n=1 Tax=Solenopsis invicta RepID=E9I9I0_SOLIN
NCBI RefSeqXP_001604196.17e-2052.86%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3228024006e-2251.80%hypothetical protein SINV_06702 [Solenopsis invicta]
NCBI nr blastxgi|3071701094e-2824.88%Receptor expression-enhancing protein 1 [Camponotus floridanus]
Group
KEGG pathway 
Orthology groupMCL30845 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212289-TA
ATGCTACACAAACATCACGATCAAGGAGGCGGTGGTTTAGTACAACAGCTTCGTAAGAGCTACAGTCTGTCGGACCTGAGCGAGTGCGAGCCGCGAGAGGAACGAGCGCCCGATGAAGCGGACGACGTGCTCGCCGAGCCAAGGCTCATACGACGAGCTATTAAGAGTGGTTTCGCGACTCGTCGCAGCGCGTCAGAATCAAACAGTCGTACTCCGATATATTTCCCCGAAGTAGACGTCGACGTGCGTGGACCCAGGCGTGTTGATGAACCTGATTTCAGTCACATAAAATCCAGCGAGGACATTAGTTCCGGCTACTCATCAGCGGAGAACTCGTCTTGTTTAAGTCGCACGTCGTCCGCTGGGGCCCGTTCTCGTCGCGTCACACGGACTACGATCACGTCCATCAAGAGACCGCAGGCCGCTGAGGACAATGAAGTCATCGAGGAGCTTCCCTACGACGATATATGTGACAATTCCAATAATCACCGTATCTCTTCCCCCCTTTATTTATCCCACACAAATCCTCCTTTCATATACCAGTACGCAGGAGATCAGGTTAATATTATACAAGTATTAGGTAATTATCCACTGAGCAATGAATTTGATAAAAAAATTAATAATCTAGATTCACGGGACCAAAAAGAAACTGTTGACAAAACGTCACAAAAATTAGACGCCCAATTTAAGAGTGATAATGAAAATGTTATAAAAGATTTAAACGTGACAAACATTCCTGTACAAGAAGAAATCAAAGACAATAAAAATAATATGATTAGCCAAAATAATGAGAGTGTAAATATTGTTAATACTGAAAACGTTAATGCAGAAAATCACTTCATTGAAAATACGGGCGAGTTTAAAACTATGGACGAAATTACTGATCTTAATGATGAGGGTGAACCTGAAATTAAAATTCATAACACACTTGAAAGTGACGAAAGTTCGTTTGAAACCCCGACTTCTGTTGAGAGTGACGAAGGATCGTTTGGAACGCCGGCATCCACGCCTAAAACTGTCCGTAAAATGTCAAAAGGCAAATATGGCAAAGACAAAGCACCTATGCCACCTAGCAATAAAGTTGTAGACGATAAACAAGAAATCCAAGAAGATTCTGAAAATTTATCAATTCGTACTGAAGTTATTGAATCTATTGAAACCAATGAAAAGAAAAGTGATGACGTCGTAGTTATGCCTATAATAGAAGACATTGAAAAAGCGACTACAGATTTTAATGTAAATGAATCAGAAATAGCAATTGATGCTGGTGATTTAAAGTTGGAAGTTAATAATGCTTCAAATTCTTCGTTATATTTAGACGACAAACATGACAAGAAAAGACATAAATCCAAATCGCCCGGTCCAAAAGCAATGTCAACTCTCGGTAAAATGCTACAACTACCAAACAAATTAGTATTTTGGCATAAAGCAACAGGTAATGTATCCGACGCATCGGATTCAAGCAGGAAATCTTCTATTGAGAGTTTAAAAGACGAGTCTCGAGGTTGCAGTTACATAAATACCGTTCATAAAAGCGATTATGAAAACGACAAAAATAAATCAGACGTAAAATTAAGCACAGATAACATATCGCAAGAAATTTCAGAGAAAAGTGATGAATTACAAAAAGTTATAGAAGCGAAACTTGAAAGTAATCCTGAATATAAATTCATACCTTTATGCGAGGACATACAGATATCCAAAAGCACTGATGTATAG

Protein sequence:

>DPOGS212289-PA
MLHKHHDQGGGGLVQQLRKSYSLSDLSECEPREERAPDEADDVLAEPRLIRRAIKSGFATRRSASESNSRTPIYFPEVDVDVRGPRRVDEPDFSHIKSSEDISSGYSSAENSSCLSRTSSAGARSRRVTRTTITSIKRPQAAEDNEVIEELPYDDICDNSNNHRISSPLYLSHTNPPFIYQYAGDQVNIIQVLGNYPLSNEFDKKINNLDSRDQKETVDKTSQKLDAQFKSDNENVIKDLNVTNIPVQEEIKDNKNNMISQNNESVNIVNTENVNAENHFIENTGEFKTMDEITDLNDEGEPEIKIHNTLESDESSFETPTSVESDEGSFGTPASTPKTVRKMSKGKYGKDKAPMPPSNKVVDDKQEIQEDSENLSIRTEVIESIETNEKKSDDVVVMPIIEDIEKATTDFNVNESEIAIDAGDLKLEVNNASNSSLYLDDKHDKKRHKSKSPGPKAMSTLGKMLQLPNKLVFWHKATGNVSDASDSSRKSSIESLKDESRGCSYINTVHKSDYENDKNKSDVKLSTDNISQEISEKSDELQKVIEAKLESNPEYKFIPLCEDIQISKSTDV-