Monarch geneset OGS2.0

DPOGS202744
TranscriptDPOGS202744-TA1347 bp
ProteinDPOGS202744-PA448 aa
Genomic positionDPSCF300464 - 28912-39778
RNAseq coverage70x (Rank: top 66%)
Annotation
HeliconiusHMEL0115254e-7647.72% 
BombyxBGIBMGA001746-TA9e-6638.08% 
DrosophilaCG7381-PA5e-2029.41% 
EBI UniRef50UniRef50_B0W5X74e-2523.89%Putative uncharacterized protein n=3 Tax=Culicinae RepID=B0W5X7_CULQU
NCBI RefSeqXP_001844111.18e-2623.89%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700324841e-2423.89%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|3072023021e-3426.60%hypothetical protein EAI_14262 [Harpegnathos saltator]
Group
KEGG pathway 
InterPro domain[365-414] IPR0061498.5e-06EB domain
Orthology groupMCL17445 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202744-TA
ATGTGTGAAGCAGGTGTGACGATAAGCATGGACAAACAAACGATCCTGTTATTTGCCCTGCTCCTGAACGCATCGTTGGTGTGTTCACAATGGGTGTGCAGGCAGGACAGCGACTGTTCGAGCTTGTCTGGGAGTTCATGTATATCAGGTCAATGCTCGTGTGAGCCGGGACAACAGGCGGTGTTGGGAGGCAGTATGTGTATAGATGTGGCTCCCTACTTCACGTCAACGTGTATTGAAGATTTTCAGTGTGGTGGGTTATTCACTGGCTTCGAGTGTCGAAAGAATGGAAACTCTACAGTGGGCCAATGCATGTGCCAACCTGGCTTCCACTACCTCAATGGTAGATGTTGGAGGTCAAAGGAACATGGACAGGCCTGCAGCACTAGTGACGAGTGTATGAACCCAGCGAGAGATCCCTTCAGTCTCCAGTGTAACGTGACCTGTCAGTGCGCTGAAGGATACTCCGAAAGACAGAGAGGCATTTGCAGGAAGACAGCCAGTGCTGTAGGCGAGGCTTGTGTTCTGAACAGCGACTGTAACTACCCTGGCGGTGTCTGCAATCCGCGTACGTTCGTTTGCAGTCAGGAAAACGACACATCAACGGACAGTCCGTTCCGTAACGTTGAAATACAAACACATCAGAAAATCTCCAGAGTCCGCTGTGGCGGAGGCCAGCCCTGCGCGAGCCCGTACGTGTGCTCCGGGCTGGGCATATGCGTGTGTCCCGCGGGCTACTACGAGAGCTCCCGGGGAGGCTGCTTGGCCGAGCTCGGGTCGCCCTCCACTCCGGAACAATGCGTGGGCTTCCTGGCGGAGGTGGTGGGCGGGGTGTGCACGTGTCGGGACAACTTCTTCTTCGACGAGAACATGAGGGATTGTGTCAAAGCCATCGGCGGCGGGTGTGAGGGAGACGAGGACTGCGTCATCGAGAACACGTCGTGTGACGGCACGAGGACGTGCCAGTGCCGCGACGGGTTCACGGCGTTCGAGGACATCTGCTGGGAGACCTCCGCGGGCTTCAATTCTTCTTGCAGCGTGACAGCGGAATGCGCTCTGCAATCAGCTGTTTGCACAAACGGGCGCTGTGCCTGCCTCCCCAACCACCATTACAAGGACGGCGACTGTTACCCGATGATTGCGCTCTTCTCTCTGTGCACGCGATCCAGCGAGTGTTTCCTGGGAGACGACATCTCTGATAGAGTGGTCTGCAGGAACGGAGTGTGCCAATGCGACTTTGACTATCCTTATTCAGAGGAATTAAGGATTTGCACGTCTTCATCCACAACTCTGATGGCAACCTCCTTTGTGATCGTAGCTGCAATACTATCAATACTAGTTCAATAA

Protein sequence:

>DPOGS202744-PA
MCEAGVTISMDKQTILLFALLLNASLVCSQWVCRQDSDCSSLSGSSCISGQCSCEPGQQAVLGGSMCIDVAPYFTSTCIEDFQCGGLFTGFECRKNGNSTVGQCMCQPGFHYLNGRCWRSKEHGQACSTSDECMNPARDPFSLQCNVTCQCAEGYSERQRGICRKTASAVGEACVLNSDCNYPGGVCNPRTFVCSQENDTSTDSPFRNVEIQTHQKISRVRCGGGQPCASPYVCSGLGICVCPAGYYESSRGGCLAELGSPSTPEQCVGFLAEVVGGVCTCRDNFFFDENMRDCVKAIGGGCEGDEDCVIENTSCDGTRTCQCRDGFTAFEDICWETSAGFNSSCSVTAECALQSAVCTNGRCACLPNHHYKDGDCYPMIALFSLCTRSSECFLGDDISDRVVCRNGVCQCDFDYPYSEELRICTSSSTTLMATSFVIVAAILSILVQ-