Monarch geneset OGS2.0

DPOGS215069
TranscriptDPOGS215069-TA2079 bp
ProteinDPOGS215069-PA692 aa
Genomic positionDPSCF300208 + 564536-569806
RNAseq coverage787x (Rank: top 16%)
Annotation
HeliconiusHMEL0000080.077.35% 
BombyxBGIBMGA005548-TA6e-7774.88% 
DrosophilaCG5098-PB9e-1944.90% 
EBI UniRef50UniRef50_C9S2660.077.21%HM00008 protein n=35 Tax=Nymphalidae RepID=C9S266_9NEOP
NCBI RefSeqXP_001122946.13e-2627.75%PREDICTED: similar to CG5098-PA, partial [Apis mellifera]
NCBI nr blastpgi|2613359320.077.21%HM00008 [Heliconius melpomene]
NCBI nr blastxgi|2613359320.078.18%HM00008 [Heliconius melpomene]
Group
KEGG pathway 
Orthology groupMCL25145 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215069-TA
ATGTCCGGTGGCTCGCAACACAACCCTAATGGAAGACAATCGCAACTGGGAGCGGGTTGGAGCCCATTACAGTTACAGGTGGCCAATCTGCTGGCACGAGCTCCTCAAGCGCACATGGCTTCGGAGAGGGGTCTGACTTGGCACACACCAACTCCACCACCACATTTATACCATGTACCAGCCCCATCACCACCCGATCCGTTACAAGCAATGAAGGGTGCCAACTCGGTGTTCAGACACACAGCCACCCCTCCAGAACTGTTCAGACACACGCCCACACCTCCGGACAAACATTTGATGCGGCACACTCCATCCCCGGGTGACGTCAAGGCCGGAGCTGCACTCCCACCGAGTGATCTGTATCCTCATTTGGGACCAGGGAGGGAAAAACTCCTCCGAGCTGATGAGTTGCTGGCGTACGCGCGTGGTGGGGTCGACCTGACGGTGGGCTCGCGGGGGTCCAACGGTTCTCCCACAATACACTCGGGTCCCGCGCTGTCCGTCCGCGATGCGCCCACAATCAACAGCCTGCTCGCCCGGTCGCGTCCGGCCCACGCGAGGGTGGACTCTCTACTGGAGCGTCTCTCCGCCGCACCCCCCGTCTCCCCCCACTCCGTCATAGTTCACCCCCGCGCCGCTCCCACGCCGTCACCCCCCTCCTCTAATGAGGATTCGGCGGATTCTTGCGGCGCCCCGCCGGGTTCCAAACGGAAACGGAAACCGGAACGCACAATACGTGTTCCTGTACCACCTCCTCCGCCCGCTGAAGTACCAAGACCCGCAACACCGCCTAGAGTTTTACCAGTACCGACACAGCCGTTAGAAAATGGCGACACAGGACACATGACACAAAACGGACCACAGAAACCTCCCACTCCACAGCCACCAGAACCACACGCTAGAAGAAAAACACGATCAGGTTCCGCTGAGACCATAGACGATATAGCGGCCATGATCGCCAGTACAGACTCAGAGCGTCGAGATACAATCCCAGAACCGGCTCCACCTGAACCGGAAGTTCCGGTCACAGTTGATAAACTAAAAAGTGTCTTAGCAAGTCCAGAACCCGAAGTGAAATCGCCGCCAGCCATACAGAACTGCCCGCAGCCGGAGCCGAAATCTCCGAAGAAGTCGCCTAAAAAAGAGGAACCGGAAACGCCTCCCGCTGCGGCCGCGGCGCCGCGGAGGAGAAGCACAAGAACATCAAACACCGAATCAGCTAACGTACCAGTCGAATCACCGTCGGAAGCAAAGCCAGCGGAGCCGGAGTCCAGTGAGGCGAGGAGTGCGGAGCCGACTGCAGCTAGCTTCATGGAAGTTGAGAATCAGTTGGAGAAAATGTTCGCCGGCTTAGAAGAAGCACCGGCCTCCGAGCGAACGGAGAGCGCCAGCGACAACAAACAGACGGCCAAGGCCAGGAAACGGAGAAAATCTGCCCCCACCAAAGGCGAACGGAAGGCGTCCAAGCGCGCGAGCAGGGCTTCCACCGGCGACGAGGCGCCAAGGCGGAAGACGATCAGGAAAAAAATCGACAAAAAGAAAAAGGAATCCGTCAAAGACGCGTACGACTCGGGCAGCAATGCGAGCTCCAGTCGATCGAGAGGACCCTACATACAGATCGTGGGTCCGCGGGATTCCCCTGTTTCCGTGTCAGTCATCAACGCGGCGGCCGGCGAGGAGGAGCGCAGGACTGCGGACGAGTGGAGGAACGGCCTGCGGGCGCGGGGTCTGCACGCGAGCACGCTGGGCCTCCGCTACGACGCCTCCACTCCGGACGCCTCCTGGCTGTGCGCCTTCTGCGAGCGGGGACCTCACCACTCGGGCCTGGGAGACCTATTCGGACCTTATCCCATTGACGTCAACTCAGAGGAGTATAGTACTCTGGACGAGTCATCGAAGCGTCGTTTCACATCGTCGTCCGGAGGTGCGGAGGTGTGGTTTCACGAGGCGTGCGGCGTCTGGGCACCAGGGCTGCTGGCGGCGGGCGCCAGGGCGCCCGCCTCTGGGGGCTGGCGGAGGCCGTCTGCGGAGCCCGGGGCCTCCGCTGCGGGTCCTGCGGTCGGCAGGGCGCCGCCCTAG

Protein sequence:

>DPOGS215069-PA
MSGGSQHNPNGRQSQLGAGWSPLQLQVANLLARAPQAHMASERGLTWHTPTPPPHLYHVPAPSPPDPLQAMKGANSVFRHTATPPELFRHTPTPPDKHLMRHTPSPGDVKAGAALPPSDLYPHLGPGREKLLRADELLAYARGGVDLTVGSRGSNGSPTIHSGPALSVRDAPTINSLLARSRPAHARVDSLLERLSAAPPVSPHSVIVHPRAAPTPSPPSSNEDSADSCGAPPGSKRKRKPERTIRVPVPPPPPAEVPRPATPPRVLPVPTQPLENGDTGHMTQNGPQKPPTPQPPEPHARRKTRSGSAETIDDIAAMIASTDSERRDTIPEPAPPEPEVPVTVDKLKSVLASPEPEVKSPPAIQNCPQPEPKSPKKSPKKEEPETPPAAAAAPRRRSTRTSNTESANVPVESPSEAKPAEPESSEARSAEPTAASFMEVENQLEKMFAGLEEAPASERTESASDNKQTAKARKRRKSAPTKGERKASKRASRASTGDEAPRRKTIRKKIDKKKKESVKDAYDSGSNASSSRSRGPYIQIVGPRDSPVSVSVINAAAGEEERRTADEWRNGLRARGLHASTLGLRYDASTPDASWLCAFCERGPHHSGLGDLFGPYPIDVNSEEYSTLDESSKRRFTSSSGGAEVWFHEACGVWAPGLLAAGARAPASGGWRRPSAEPGASAAGPAVGRAPP-