Monarch geneset OGS2.0

DPOGS200180
TranscriptDPOGS200180-TA1512 bp
ProteinDPOGS200180-PA503 aa
Genomic positionDPSCF300128 + 656412-670199
RNAseq coverage3114x (Rank: top 4%)
Annotation
HeliconiusHMEL0094420.075.05% 
BombyxBGIBMGA002930-TA4e-17267.92% 
DrosophilaCG30118-PC7e-14657.02% 
EBI UniRef50UniRef50_E2AE061e-15257.09%Kelch repeat domain-containing protein KIAA0265-like protein n=20 Tax=Pancrustacea RepID=E2AE06_CAMFO
NCBI RefSeqXP_393362.22e-16068.62%PREDICTED: similar to CG30118-PA [Apis mellifera]
NCBI nr blastpgi|3320272808e-16058.63%hypothetical protein G5I_04007 [Acromyrmex echinatior]
NCBI nr blastxgi|3320272805e-15458.75%hypothetical protein G5I_04007 [Acromyrmex echinatior]
Group
KEGG pathway 
InterPro domain[226-397] IPR0235774.8e-22CYTH-like domain
Orthology groupMCL15721 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200180-TA
ATGTGTTTGACAAAAATGGCGAATTCATCCCAAAAAATCGTGTACAAGCTAGTACTAACCGGAGGACCTTGCGGCGGTAAAACCACGGGACAATCCCGTCTCAGCACATTCTTCGAAAACTTGGGATGGAAGGTGTTCCGGGTGCCGGAAACGGCTACCGTCCTCCTAAGTGGCGGCATAAAGTTTGTAGATCTCAGCCCAGATGAAGCAATGAAGTTCCAGGAGAATCTACTGAAGACGATGATACAGATCGAGAACACCTTCTTCGAGTTGGGCAGAACTTGTCAGAGGAACTGCCTCATCATATGCGATAGAGGGGCCATGGACGCTAGCGCATCTGTTAGCTCTAATGGCTTCGAGAGCGTCACCGGGGAGGCCATTAGGGTTATTAATTCCTTCGCATACAGAATTAACAATTTTTCATGCCGTTACCATATAAGTTTCATGTTTTTGACGAAATATCAAGGCCAACAAATTTACAGAGAGCACGAGCACGCGTGTAGGTCAGAAGGCGTGGAGCTGGCTAGGGAATTGGATTACAACGCAGCGGCCGCTTGGATAGGTCACCCATACTTCGACGTCATTGACAATTCTACCGACTTCGACAAGAAGATGAACCGTCTCCTGGCTTGCGTGTGTCAGCGGGTCGGCCTGGACACCGGTGACCGACTGAACGTTAACTCCAAGAAGCGCAAGTTCCTAGTGAAGCCGCCGTTGTTGCCGGACGCAGAGTTCCCTCCATTCCAGGACTTCGACGTGGTCCACAACTATCTTCAGAGCGACGTGCGCAAGGCCCAAGTCAGGCTGCGGAAGCGCGGTCAGAAGGGCCACTGGTCATACATCCACACCGTACGCAAATTCCATCCAACGAACGGCCAGTCTGTGGAAGTCCGCACGCAGCTGACTCACCGCGACTACCTCAACATGCTGCCCCAAAGGGACGACGCTCATTTTACCATCTTCAAAAAGAGACGTTGCTTCATATACAACAACCAATACTACCAGCTCGATATTTACCGACAACCCACACATCCCAGGTGTCGCGGCCTAGTACTTCTGGAGACCTACAGCGGTATGTACGACCAAGATGCTCTGCTCGCATCTCTGCCAATGTTCCTTACTATTGAGAAGGAGGTGACCGGTGACCCTGCTTACTCCATGTACAATTTGTCTCTGAAGGAGGACTGGAAGACCTCCACTAAATACTACTCCGGCTCTGAAGGGCAAACGAAATGTATGAATGGTCACAGTACAGAAAAAGCTAACGGCCATATTAGAAACGGTCATACAGAGAAAGTTAACGGCCATGACGCTAAAGCTAACGGTCACGGGGCGAAGGTCATAGGTCACACGTACAGCAAGATATCGAGCATAGGGGGCGACAGGAACGGTTATCATAAGGAGAACGGCTTTGGGCACATTAACGGTGACGTCACGGATGCTGACTACAACATGAAGAACACCAAGGCCAGATCACCAAAGATACACAAGAAAGATGCTGTTAAAGTATAG

Protein sequence:

>DPOGS200180-PA
MCLTKMANSSQKIVYKLVLTGGPCGGKTTGQSRLSTFFENLGWKVFRVPETATVLLSGGIKFVDLSPDEAMKFQENLLKTMIQIENTFFELGRTCQRNCLIICDRGAMDASASVSSNGFESVTGEAIRVINSFAYRINNFSCRYHISFMFLTKYQGQQIYREHEHACRSEGVELARELDYNAAAAWIGHPYFDVIDNSTDFDKKMNRLLACVCQRVGLDTGDRLNVNSKKRKFLVKPPLLPDAEFPPFQDFDVVHNYLQSDVRKAQVRLRKRGQKGHWSYIHTVRKFHPTNGQSVEVRTQLTHRDYLNMLPQRDDAHFTIFKKRRCFIYNNQYYQLDIYRQPTHPRCRGLVLLETYSGMYDQDALLASLPMFLTIEKEVTGDPAYSMYNLSLKEDWKTSTKYYSGSEGQTKCMNGHSTEKANGHIRNGHTEKVNGHDAKANGHGAKVIGHTYSKISSIGGDRNGYHKENGFGHINGDVTDADYNMKNTKARSPKIHKKDAVKV-