Monarch geneset OGS2.0

DPOGS204660
TranscriptDPOGS204660-TA1824 bp
ProteinDPOGS204660-PA607 aa
Genomic positionDPSCF300170 - 461149-468369
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0082480.069.03% 
BombyxBGIBMGA007463-TA0.058.51% 
Drosophila% 
EBI UniRef50UniRef50_E2BB008e-3530.61%Putative uncharacterized protein n=4 Tax=Formicidae RepID=E2BB00_HARSA
NCBI RefSeqXP_001121835.12e-3228.82%PREDICTED: hypothetical protein [Apis mellifera]
NCBI nr blastpgi|3072106683e-3430.61%hypothetical protein EAI_06767 [Harpegnathos saltator]
NCBI nr blastxgi|3071794909e-3627.48%hypothetical protein EAG_07302 [Camponotus floridanus]
Group
KEGG pathway 
InterPro domain[516-603] IPR0009384.8e-11Cytoskeleton-associated protein, Gly-rich domain
Orthology groupMCL22158 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204660-TA
ATGGCGGTTCCTAATGAAGCAGAGGTTAAGTCTAAGTCGGAAAAGTTTGACGTTCAGGTCAGGATCACTTCAGAAGACAGTCTCGTGGTGAAGGAGAGCATAGATCGTGATAGGAGGAAGGACACCCTGAATTTGGACGTCGATAATTTAAGTCTACGATCGTTGTCTGAAGGTGACAATTCCGTTTTCTACGAGGGTGGAACAACGCCGCAGGATCCAAATTCCGCGCCCGATGATAAACAGGAACAGGAAAACGTTAGCGGCAATGAATCGGAAGAAGTCGATGATATAGAACTCATCTTCACGACGGATGAGTCCAAGGATATAAGCAATTTACAGGAAGACCTGGTTTCTATACAGGAAACCGATAATTGGACACCAGCGTTATCTTCGTCGACCCATTCCACACCAGTTCTTATCAAGTTCCACACTTTAGACCCTGACTTCCAACCGGGCAGCAAAGCTCATGAAGTTAAACCAGAAATTCGGGAACATTCCATCCAACGAACTGCTACAGAATCTTCGTTAAGGATGAAGAGGATTTCCCTCCCTAGCGACAAGGACATACGTCACGTGGGTTTCAAGGAAAAAGGAACCCTAGAACCACCAAGTCGTGGTATCTTAAAGAGCTACGACAACAGATCAGATAATTTATTATACAGGAAAAGTCCGAATACAAGTACTAGAAGGTTGGACAGCGTGGACAGTTTGACGAGCGAATACCGCGGGCTCAGCTTCGATAACACCAAGTCCTCGAGTTTCGAGCTGGGTTCAAGTTTGGATGTGCTGCATAGAGACGAAAGCATCAGCAATTTCAATAGAAACAGAATGGGTCATCGGTATTCTGTGTTTGCGGCCACAGATATATCAAAGTGCGGCACGTCCCAGGACGATCTGGCGAGTAATTTGAATGCGAGAAGGAATACGTGCCCAAATCCGTTCCAGTACGGCCTATCCCGTGGTCTACGAGGTCGCAAACCGGTGCGTACCCGCGTGCTACACCAGGGCTTCGCCCCTCGACGTGAGAGCGCGGCCCAGACCGACTTATCAGCTCTACCACCAAGGTGGACTTCGGATGGATACCTGGCATATAAGACATGCGTGTCTCCAGGGGCGGCGCCGACGTTGCCTCAACGAACGACACCCCGGCGACCGCTGTTAAGTGACCTCGGCTTCACTTCGATGGTTCCGGAGCTGTCTCGTTCGGCGGAGCCTCTGTGGGTTCGTAGAGTCCCCCCGTCGCCTTGCGTGCCCTCATGTGGACCTACCCTGCACCCCCCATCAGGGTTGGAACCGCCACGGTCACCATTGTTTCGTTACCGTTACCGTTCGCCTAACATCAGTCTCGACAGAAATACGGAATGGACTCCACCAACATACGCCACACCTAAAGGTGTATGGCGCGGATCGCTGCCGGATGTGCGTCACGACGACACAGACGAACTACTGCGAGACACTGAAGTATATCTACGACGGTCTATCGATAATTTGCGGTCTACTTCACTTGAAGCAGTGAACTGTAAGGACACTCCGGGACAACCTTACATCCCATCAGAAGCTCGTCACCTCCGTTTAGGACACGCTGTCAAATTAATCACCAGCACTGGCCGGCTGGCCGTGGGAAGGGTCAGATACGTCGGCCTGGCTGGTGGGACTGCCGCTAACAGCAGTGTAGTGGTGGGGGCGGAGTTTGCCCTCAATCAATACCCGGGGATTCCCCTCAACGACGGTACCTACAATGGCCGCAAATATTTCGTGCCCCAAGCCCATCACACCGCGCTATTTGTACCGTTCTCTAAAGTGGTCATGGCGTGGGCAAATTAA

Protein sequence:

>DPOGS204660-PA
MAVPNEAEVKSKSEKFDVQVRITSEDSLVVKESIDRDRRKDTLNLDVDNLSLRSLSEGDNSVFYEGGTTPQDPNSAPDDKQEQENVSGNESEEVDDIELIFTTDESKDISNLQEDLVSIQETDNWTPALSSSTHSTPVLIKFHTLDPDFQPGSKAHEVKPEIREHSIQRTATESSLRMKRISLPSDKDIRHVGFKEKGTLEPPSRGILKSYDNRSDNLLYRKSPNTSTRRLDSVDSLTSEYRGLSFDNTKSSSFELGSSLDVLHRDESISNFNRNRMGHRYSVFAATDISKCGTSQDDLASNLNARRNTCPNPFQYGLSRGLRGRKPVRTRVLHQGFAPRRESAAQTDLSALPPRWTSDGYLAYKTCVSPGAAPTLPQRTTPRRPLLSDLGFTSMVPELSRSAEPLWVRRVPPSPCVPSCGPTLHPPSGLEPPRSPLFRYRYRSPNISLDRNTEWTPPTYATPKGVWRGSLPDVRHDDTDELLRDTEVYLRRSIDNLRSTSLEAVNCKDTPGQPYIPSEARHLRLGHAVKLITSTGRLAVGRVRYVGLAGGTAANSSVVVGAEFALNQYPGIPLNDGTYNGRKYFVPQAHHTALFVPFSKVVMAWAN-