Monarch geneset OGS2.0

DPOGS212109
TranscriptDPOGS212109-TA1614 bp
ProteinDPOGS212109-PA537 aa
Genomic positionDPSCF300038 - 406956-428112
RNAseq coverage100x (Rank: top 61%)
Annotation
HeliconiusHMEL0125240.093.37% 
BombyxBGIBMGA006601-TA0.092.63% 
DrosophilaCG14762-PB1e-17364.11% 
EBI UniRef50UniRef50_D0Z7562e-17164.11%MIP14966p n=24 Tax=Neoptera RepID=D0Z756_DROME
NCBI RefSeqXP_970465.10.068.87%PREDICTED: similar to slit protein [Tribolium castaneum]
NCBI nr blastpgi|910898430.068.87%PREDICTED: similar to slit protein [Tribolium castaneum]
NCBI nr blastxgi|2700136420.068.87%hypothetical protein TcasGA2_TC012268 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL14556 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212109-TA
ATGGACGATAAGAACCTGATAGATACGGCTGTTTCTATAACATTTGTAATTTTTTTGTTATTCACGTTAGGGTGCAATCTGATGGAGAGGTATCCAGAGGATGCCTCAGAACATGACTTAGTGCCCCATGAGCGCGGCACGCCACTCACCGATACGTCACATACATATTTCAAGATGGTGCCTCGTTGGTTTGTGGTCTGCTGCGTCGTGCTGTTCGCGAGGCCGGTACACCCTCAAGGCGCTCAGCAATGCCCCCCTCCAAACACCATAACACCCTGCTCTTGTACCGTCAAGAAGAATGGATTGGACATTTTGTGTGAATTCACCGAGCAGCAACATATTCAGAAGGCTATGAACACGCTCCGATCGAAGCCCATGATAATATTTTACTTGAAGCTCCGCCACAACAATCTAGCAAAGTTGCCGTCATACATATTTCTGAGCTTGGATATACGCCACCTAACTGTTCACAACAGTAGTTTAGCTGTTATTGAAGACTCTTCTTTGAGTTCTATCGGCAACAAGTTAACACAGCTTGACGTGTCCCAAAACAATCTGGCGACAATTCCGACATCGTCCTTCACGCATCTGAATCACCTGCTTATACTAAACATGAATCACAACAAGGTGACTGCTATTCACAACAAAGCATTTCTGGGTCTGGATACGTTGGAAATACTAACTTTATACGAAAATAGAATATCTGTCGTCGACGGCGAGGCTTTTAAGGGCTTAGAAAAGAAGCTGAAACGGTTAAATCTTGGAGGAAACGAATTAACAGCGGTGCCACAAAAGGCTTTAGCTCTGCTCGAGAATTTAAAGAAACTTGAAATGCAGGAAAACCGTATTACGTCCATTTCAGAAGGCGATTTTGCAGGCCTACGTAATCTGGATTCCCTCGGGCTGGCCCACAACCAGCTACGTGAAGTACCAGCAAACGTATTTTCTCATTTGACATTCCTCAACTCTCTGGAGTTAGAAGGCAATCTCATTCAGCGGATCGATGAGAAAGCTTTCGCTGGACTGGAAGAAAACTTACAATACCTTCGGCTCGGAGATAATCGTCTCCACGAGATCCCATCGGAGGCCCTGCGCCCTCTACATCGCCTACGTCATTTGGACCTTCGCTCGAATAACATTACTTACATCAGTGAAGACGCTTTCACTGGATACGGCGACTCAATTACTTTCCTGAACCTCCAAAAGAACATGATACATCAATTGCCTACAATGGGTTTCGATAACTTGAACTCTCTCGAGACTCTCAACCTGCAGAACAACAAGTTGCAACATATTCCCGAAGAGATTATGGAGCCGATACTCGATACTTTGCGTGTTGTGGATATTATGGATAATCCTCTCATTTGCGACTGCGACCTAGCTTGGTACGAATCCTGGCTATCTGGTCTCCGTGACCGTGACGACGAGATGATGCAGAAGAAACGCACCATTTGTACAATGGCTAGCGAGCACCGGGAGTATAGCGTTGCCAAGATGCCATTAGAGAAAATGAGCTGCAAGCGAAAGCCCGGTTACGGTAGACCTTCCAGCGCAACAAATATCTCCCCAATTTTGGCTAATGTTATAACCGCTGTCGTCGCTAGATGGCTATAA

Protein sequence:

>DPOGS212109-PA
MDDKNLIDTAVSITFVIFLLFTLGCNLMERYPEDASEHDLVPHERGTPLTDTSHTYFKMVPRWFVVCCVVLFARPVHPQGAQQCPPPNTITPCSCTVKKNGLDILCEFTEQQHIQKAMNTLRSKPMIIFYLKLRHNNLAKLPSYIFLSLDIRHLTVHNSSLAVIEDSSLSSIGNKLTQLDVSQNNLATIPTSSFTHLNHLLILNMNHNKVTAIHNKAFLGLDTLEILTLYENRISVVDGEAFKGLEKKLKRLNLGGNELTAVPQKALALLENLKKLEMQENRITSISEGDFAGLRNLDSLGLAHNQLREVPANVFSHLTFLNSLELEGNLIQRIDEKAFAGLEENLQYLRLGDNRLHEIPSEALRPLHRLRHLDLRSNNITYISEDAFTGYGDSITFLNLQKNMIHQLPTMGFDNLNSLETLNLQNNKLQHIPEEIMEPILDTLRVVDIMDNPLICDCDLAWYESWLSGLRDRDDEMMQKKRTICTMASEHREYSVAKMPLEKMSCKRKPGYGRPSSATNISPILANVITAVVARWL-