Monarch geneset OGS2.0

DPOGS210213
TranscriptDPOGS210213-TA1209 bp
ProteinDPOGS210213-PA402 aa
Genomic positionDPSCF300196 - 698767-713388
RNAseq coverage155x (Rank: top 53%)
Annotation
HeliconiusHMEL0146551e-1587.80% 
BombyxBGIBMGA002537-TA1e-12275.59% 
DrosophilaCG17364-PA7e-5643.31% 
EBI UniRef50UniRef50_A7UTJ33e-6350.85%AGAP004943-PA n=5 Tax=cellular organisms RepID=A7UTJ3_ANOGA
NCBI RefSeqXP_393172.22e-6444.24%PREDICTED: similar to CG17364-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838663901e-6346.02%PREDICTED: uncharacterized protein LOC100876451 [Megachile rotundata]
NCBI nr blastxgi|3838663904e-8047.53%PREDICTED: uncharacterized protein LOC100876451 [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL16220 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210213-TA
ATGGAGGTGGTCGCGCTGGTGGTGTGCGCGTTGGTTGTGAGCGCAGCGCCGAACGCCACCCGCCGCCGAGAAGCGCCCGACCACGATGAAGACGAGCTCGACTACTACGATGAAGCCAAGATCAATATATCAGAAAAGGGTATGTATCTCGGATCTCCGTGCGAGTTCACTTGCAACCCCCGTCTGCTTCACGTGTACTGCGAGCCAAGAACCAGCTCGTGCCAGTGTGATCTTAAATACCCCGTGTCGCTCGGGGTCGCCAGAGGCTGCGCTAAGCCAAAGAAACTGGGCGAGCAATGTTTCTACCAGGAGACTTGTCGGGCGTTCGACCCTCACGCATCCTGTGCGCAGATCAATCACAACGCGTACTGCCAGTGCGATGCCGGTTACCACACCACCACACACTCCAGGCCCACCCAAAGAATCTTCTGCACTGAAGACTTGGTACTGCTGACGGCGGATATGCCGACGCTGTTAGGAGTATCGTCGGGCATCTTCGTTCTGGCGGGACTGTTGTGTATGGTGCTACACCTGTACACGAAGGCGCGCTACCATCCGGCACACTTGGCAGACGCCAGGCTCACACCGCCCTGCCTGTACTCGCTTAATGATACCGGAGGTACACTGAGTGCTACCCGCGCCTCGTCCCGAGCGTCTTCTCGTAGTGGCTGCACTAGCGGTACCCTGGAGGAGTCTCGGCGGGAGTCTCGGCGCGGGGCGTCCCGGGCAGGTGCGGCGCGGACGGCGGCCATCTTGCTAATCTCGCGCCACCTGAAGGCCGTAAGGGACGGCGGCGCGCACCCGAAGTGCGCCGCCAAAGGTGATCCGGATGATGTACGCGGCCCTGTATGCACGTTCCGGGCGGACCTCCTCCAACGTTCGAGACGTCCAAGTTTAAGTTCAGTGCAGAGCAGCTCATCCTCGATTAGGAGTTATAGTGCGAAGAAGTGGGAGAGAGAAAGAGAACAGAAAGAGAGACGGCAAATGAGCATGCGCCTGGCACAGCTTCACGACAAGATGACGGCCGGGCGAGATCTCCCAAAACACGCCCCGACTCCGTCGCCACGTTCACCCAACAACTCCACTGATGGGCTGCTTCCAGCTGTATTTAGGAGCAAGTTCAGAGCAACAACTGCAGGGCGTGGACGGGCCTTGCTCGAGTTCCTCGCTTTACTGACAGCAAAGGTCAACGACCATCAGGTCGATTGA

Protein sequence:

>DPOGS210213-PA
MEVVALVVCALVVSAAPNATRRREAPDHDEDELDYYDEAKINISEKGMYLGSPCEFTCNPRLLHVYCEPRTSSCQCDLKYPVSLGVARGCAKPKKLGEQCFYQETCRAFDPHASCAQINHNAYCQCDAGYHTTTHSRPTQRIFCTEDLVLLTADMPTLLGVSSGIFVLAGLLCMVLHLYTKARYHPAHLADARLTPPCLYSLNDTGGTLSATRASSRASSRSGCTSGTLEESRRESRRGASRAGAARTAAILLISRHLKAVRDGGAHPKCAAKGDPDDVRGPVCTFRADLLQRSRRPSLSSVQSSSSSIRSYSAKKWEREREQKERRQMSMRLAQLHDKMTAGRDLPKHAPTPSPRSPNNSTDGLLPAVFRSKFRATTAGRGRALLEFLALLTAKVNDHQVD-