Monarch geneset OGS2.0

DPOGS202880
TranscriptDPOGS202880-TA1593 bp
ProteinDPOGS202880-PA530 aa
Genomic positionDPSCF300126 - 430511-433128
RNAseq coverage426x (Rank: top 29%)
Annotation
HeliconiusHMEL0145822e-3933.66% 
BombyxBGIBMGA004155-TA3e-2526.89% 
Drosophila% 
EBI UniRef50UniRef50_B0XEF23e-0726.90%Putative uncharacterized protein n=2 Tax=Culicinae RepID=B0XEF2_CULQU
NCBI RefSeqXP_001868024.15e-0826.90%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700656411e-0626.90%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1571324344e-0724.23%hypothetical protein AaeL_AAEL012427 [Aedes aegypti]
Group
KEGG pathway 
Orthology groupMCL21016 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202880-TA
ATGGCGTCCCCTGATTCTATACATTTTGATGAAACACCAGTAAACACCCAACTAGCTGAGGAGTACTTGAAAACTAAATTAGAAATACATCCCAGAGGATATGGAGAAAACGGTTCTTATGAGATGAATGCATTCAATAATAAAATATTCTTTTCCGACATTGATACAGAACCGGATAAGCTTGAAAATAAAGAAAAAACAGCGAGTAACTTATCAGCCTTGTACAAGTTAATTAAAACAAATGCAAATTTATTTTGCCAATTGGAATACCTGATCCGTGGGGTTCCTGTGAAGGATGTGGAGAGATATGATCTGAAATTGATTTTTAACAACCTCTTATATTGTAATCAGGAAAAAAAGAATTTTGTAATTTATAACGATGTTTATTGTGATAAATTAGCATTTCTGTCGGATACACCATTAGCTGTACTTGGTTTTGTCGGCCAGTCACTTCTGTGTCATCTTTTAAATAAGACTAAGATAATTACAATAATATGCGACAAACACACTGCAGTCATTGGCTCGCTGTTTGTAGACCTGTGCCACGAAGTTGGTATTTTGCAAGTGAAGCTGCTTATAGTACCAGACGACACCGACTGCTCTAAGTATGATTTGTTGAAAGTGTCAGAACTTAGTAGCGGCTGTGTTGGAGTTGTCAGCAACAGAAGCGATGTTGATTCAGCTGTTGATACATTCTTGTCCTCTTCCACCTGGTATCCCTGGAGAATAAAGAAGGTTTTCATTCAAGAATCTGCATTGAAGAGATTTACGTCAGCAATGAAATGGAAAACGAGAGGTAGGGCGAGTGACGTCACGTCCTGTGAAATCTCCTTCAGTCGGGAGGAAAAGCTGTTCGTGCTGGAACCGGGCAGCGTGCGTGACGACTCTCATCAGCTTATCGTACTAGAAGCCTATAGAACAGTGAAGGAACTGCTGAATCTACTTCAAAACGAAAAGCCATTCGCTATCTCGCTCTGGTGCAGTGATTTAGCTGAGACAAACGAGATTTCACACCACGTGGATTCCAACATAGTATGGGTCAACGATCACGCCAACTTCCAAGGACCGCCGCAATCCTCGCAGGCCTTCTATTCGGTCATAGATTTGTTCTACAGCTCTCTGACAATACCACATGTACCGGAGCTGGATCAAATCAAAAAGCTCAGAGAATCCTGGCTGAGACGCAGCCCAGAACAGAGGGAAGCTGTGTTGATGGAAGAGTCACGCAAACATACGTTTTTCACGACCATAATACTCGAGAAACATTTCCAGGACAACTATGTACGCGTCACAAAAAGCAATGTCATAATGGGGACCATTGTTCCCGGGGGAGTCTGGCTGGTGGACTTTTGTAATAAACTCGTCGACATGTCCGTGATCAACTTCGTGATGAAGGGCGGCGGCCTGCTGGTCAACTACATGAGTGATGAGATCGATACATTCTTTAAGAACTTAAATGAAACGTTTGAAGTGCCCGTGGTTTATAAGGAAGAGGAGGCGCAAAGTGATGGACGGAAAGCGAATGTACTATGGAAGCCGTGCTACAAACATAAAGTTATTTGGACCAACTATGGAACTATATTTGCCAACTGA

Protein sequence:

>DPOGS202880-PA
MASPDSIHFDETPVNTQLAEEYLKTKLEIHPRGYGENGSYEMNAFNNKIFFSDIDTEPDKLENKEKTASNLSALYKLIKTNANLFCQLEYLIRGVPVKDVERYDLKLIFNNLLYCNQEKKNFVIYNDVYCDKLAFLSDTPLAVLGFVGQSLLCHLLNKTKIITIICDKHTAVIGSLFVDLCHEVGILQVKLLIVPDDTDCSKYDLLKVSELSSGCVGVVSNRSDVDSAVDTFLSSSTWYPWRIKKVFIQESALKRFTSAMKWKTRGRASDVTSCEISFSREEKLFVLEPGSVRDDSHQLIVLEAYRTVKELLNLLQNEKPFAISLWCSDLAETNEISHHVDSNIVWVNDHANFQGPPQSSQAFYSVIDLFYSSLTIPHVPELDQIKKLRESWLRRSPEQREAVLMEESRKHTFFTTIILEKHFQDNYVRVTKSNVIMGTIVPGGVWLVDFCNKLVDMSVINFVMKGGGLLVNYMSDEIDTFFKNLNETFEVPVVYKEEEAQSDGRKANVLWKPCYKHKVIWTNYGTIFAN-