Monarch geneset OGS2.0

DPOGS210519
TranscriptDPOGS210519-TA1317 bp
ProteinDPOGS210519-PA438 aa
Genomic positionDPSCF300186 + 205121-214705
RNAseq coverage283x (Rank: top 39%)
Annotation
HeliconiusHMEL0163366e-14889.63% 
BombyxBGIBMGA012622-TA0.088.43% 
Drosophilapk-PC4e-9874.53% 
EBI UniRef50UniRef50_E0VEW11e-9978.12%LIM domain only protein, putative n=2 Tax=Coelomata RepID=E0VEW1_PEDHC
NCBI RefSeqXP_002424655.12e-10078.12%LIM domain only protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420076754e-9978.12%LIM domain only protein, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2700104371e-13553.45%hypothetical protein TcasGA2_TC009830 [Tribolium castaneum]
Group
Gene OntologyGO:00082708.4e-17zinc ion binding
KEGG pathwayphu:Phum_PHUM1420307e-100 
 K04511 (PRICKLE)maps-> Wnt signaling pathway
InterPro domain[86-138] IPR0017818.4e-17Zinc finger, LIM-type
Orthology groupMCL12744 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210519-TA
ATGGATGACATCTCGCTGAGTCAATTCGACGATACTGACGCTTTTAAATCTCTTGATCCCAAAACACAGCAGTGCGAGGAGACCATGTCTTCAGGTGACATGTGTGTATCGGCGGCCCGAGCCGGTCCTTCGGCGAGATGGCATCCCTCCTGCTTCGTGTGCAGCACCTGCCAGGAGCTGCTGGTGGACCTGGTGTACTTCTGGAAAGACGGACGACTATACTGTGGACGGCATCACGCTGAAACACTAAAGCCCAGATGTTCTGCCTGCGATGAGATAATATTGGCTGATGAGTGCACGGAAGCTGAGGGTCGAGCGTGGCATATGAAACACTTCGCGTGTCAGGAATGTTCACGGCAACTGGGCGGACAGCGGTACATCATGAGGGAAGCCAGGCCGTACTGCCTGCCTTGCTTCGACAATTGCTTCGCCGAATACTGCGACGCCTGCGGAGAGCCCATCGGCGTAGATCAAGGTCAGATGTCTCATGAGGGCCAGCACTGGCACGCCACCGAGCGGTGCTTCGCATGTCATACGTGTAGAGCCAGTCTTCTCGGGCGTCCGTTCTTACCTCGCAAGGGAGCTATATTTTGTTCTATAGCCTGCTCCAAAGGCGAACCACCAACGCCGTCAGATTCGTCAGGTCCTGGACCGAGACCACCACGCGTGCCAAGGCCAAGAAGATTGCCTTCGCCGAAATCACCAGAGAGAACTCCACCGAGGGAAGCCTCTCCCAACATAGATGCAGATTCGGAACCGAGCGCCTCTCTCGCCACTGAATCACCGCCGCCCCACCCTCCGCCGCCGCCACATCCTTGCCTGAGCTTAGACAGAGCGTTGGCTGATTTAAAGTTAGAACAATCGATAACCGAGCACCAAATACCACCGACGCCAACCTCAGATGAAGTCCAAGAGGCCCCGGAATCCAATTGGAAACCACCTCATCCTGTGGAAGCTCAAACGGTTATAACACCCGGACACTCTACGTCAATGCCCGAATTGACGCTCGTAGATGGTAGGAGTCAGAAAAAGCCCCGAGGAGGCCGAACAGTTCGATTCTGTGGTGACGACAACGAAGCCTTCGAACCTGATGAACCTGAACGGAAAGACAAAAGTAAAGCAAGAGATGACGACGCCAGCAGTTACTGTAGCACGTGTTCTTCATCTTCCTCGTCCACCGAATCATACACTCTGCCTACTAGAAGAGCGTACGGCGGAGTCAGGATATCGTACGTGCCAAACGACGCCGTCGCTTGCGCAAAGAGGGAACGACAGAGAAAGAACAATCAACCAGATAAGAATTGTATCATCTCGTGA

Protein sequence:

>DPOGS210519-PA
MDDISLSQFDDTDAFKSLDPKTQQCEETMSSGDMCVSAARAGPSARWHPSCFVCSTCQELLVDLVYFWKDGRLYCGRHHAETLKPRCSACDEIILADECTEAEGRAWHMKHFACQECSRQLGGQRYIMREARPYCLPCFDNCFAEYCDACGEPIGVDQGQMSHEGQHWHATERCFACHTCRASLLGRPFLPRKGAIFCSIACSKGEPPTPSDSSGPGPRPPRVPRPRRLPSPKSPERTPPREASPNIDADSEPSASLATESPPPHPPPPPHPCLSLDRALADLKLEQSITEHQIPPTPTSDEVQEAPESNWKPPHPVEAQTVITPGHSTSMPELTLVDGRSQKKPRGGRTVRFCGDDNEAFEPDEPERKDKSKARDDDASSYCSTCSSSSSSTESYTLPTRRAYGGVRISYVPNDAVACAKRERQRKNNQPDKNCIIS-