Monarch geneset OGS2.0

DPOGS202914
TranscriptDPOGS202914-TA1692 bp
ProteinDPOGS202914-PA563 aa
Genomic positionDPSCF300126 + 376861-382137
RNAseq coverage530x (Rank: top 24%)
Annotation
HeliconiusHMEL0145910.066.05% 
BombyxBGIBMGA004195-TA2e-15452.50% 
Drosophila% 
EBI UniRef50UniRef50_D6WR175e-1230.13%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WR17_TRICA
NCBI RefSeqNP_001135924.15e-1330.13%hypothetical protein LOC100141786 [Tribolium castaneum]
NCBI nr blastpgi|2154901061e-1130.13%uncharacterized protein LOC100141786 [Tribolium castaneum]
NCBI nr blastxgi|3287044656e-1225.00%PREDICTED: hypothetical protein LOC100168953 [Acyrthosiphon pisum]
Group
Gene OntologyGO:00055151.7e-10protein binding
KEGG pathway 
InterPro domain[1-111] IPR0119931.7e-10Pleckstrin homology-type
Orthology groupMCL24997 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202914-TA
ATGTCTGGTTACCTTGAAGTGAAATATCCGTTTAGATCGAACCTTGGTTTGAATCCTTTCAAGTCTTGGAAGAGACAATGGTGTATATTGCGGCCTAGTCCCACGACTGTGGGAGGTTCTCTGGCCGTGTACTGCAGTGAGGCTGGGGCTGCAGCGGGCACCGTCGAGTTGCGATCAGGTTGCACTGTGAAACGTGCCAAGTCTCGCACAAGGCCCCACGCTTTCGCTGTGTTTTCCGTCGGAGAACCCTGCAAGCCTCGCATCCTGTTGGCAGCACAGACTCTCCAGGAAGCCCAACAGTGGATGGACAAAATTCGTGACTTACTGAATGGCGAGAAACTATTAGGCACTGAAACATTATTAAAAGATTCCTATACTGTGACAATTATACCCACGGCATTCTCTGATAAGTGCGGTATAACAAACGAGTGCCATGTGACGCTGTCCCCTAATGGTCTCCAGCTGTGCTGTCCGCCCACCGACACAGTTATACATTGGCAGAACATTACAGAAGTTCTGCATACAAGAGAAACCGGGGACAAGAATAGAATTTGTACCCTTAATATTCATAGCGAGTCTCAAGGCGGCGGTTGTGTGCGCATGCGCGGCGCCGCGGCCGGTGAGTTGGCGGGCGCGGTCCGCGGGACGCTGAGGGATCGCGCGCGAGCGAGACTCAGTCGCAGCCAGCCAGAGCTCACTTCAGCGTGTCTTAACGCCGCGGATATTCGTCGCAGCAGTTGGTACAGCGGGCCGTCGGAAATATCACTGGACGACACTGACCTCATAATGTCCAAGGAGGCTCAGCACATTCCTTCAAGTCAACTGTCTCGTTGTAGTGGAGCTGGCGACCTCGCCTCCCGCGCTCGGAGGCTCCTCAGGACCTCGGCCGATGACAGCGTGAGTTCCCGGTCCTTGGCGTCTCTCGCGTCGCTGGTGTCGTCGTCATCGGGCGTGTACGAGGAGATAGCGGAGGAGGAGCATACGTACGAGGCGATAGGACTGTACGGAACCGCGCGCCGACCGAAGCGACATCCGCCGCCACTACCGCCTCGCCAACCATACTGTACCCTGAACCGCGGTCAGTCGTGGCGCGAGGCGGAGAAGGTGACCAGACACTCCTCACTCGGCTCACTCACACACAAACAGAGACACCATAAGACATTCAGTGTCTTCAGAAAAAGACTGAAGAGTGACTCCCGCATAGCGACATCACCGAAATCAGAGACCAAAGACAAAGACGTGGAGACGAAGAAGAAAAAGTTCGACTTCACGCCCACACGAGACATATTCAAGAGCTTTAAAGTGAGCCGCAAGATGAAGAACCTGAAGATAACGTCCGGCCTGGCCAAGGGAGAGACCAAGAGCTGCGAGTTCCTGGACGAGGCGCAGCACGTGACGAGCAACAGGTGCTCCAAGTCTGTGGAGTGTTTGGAAGACACCTACGAGTTCCTCGACGACTACCACGGAAACCTGTCCCTGAGTGACGCCGACGAGGCGCTGGCTCTGCCGCAAGAGATCGTCGAGTTGATACTGCGCGGCCCGGACCTGAAAGTTAGATTAAAAGACACTCAGTGCGAGAGCGACTACGTCCCCATGTCGCCCATAGTGCCCATACTGCCCATGGTGCCCATGGTGCCGCCGCCCATAGAACATCACTATATGGTGATGTCGCCCCGCACTAATATAGCTTGA

Protein sequence:

>DPOGS202914-PA
MSGYLEVKYPFRSNLGLNPFKSWKRQWCILRPSPTTVGGSLAVYCSEAGAAAGTVELRSGCTVKRAKSRTRPHAFAVFSVGEPCKPRILLAAQTLQEAQQWMDKIRDLLNGEKLLGTETLLKDSYTVTIIPTAFSDKCGITNECHVTLSPNGLQLCCPPTDTVIHWQNITEVLHTRETGDKNRICTLNIHSESQGGGCVRMRGAAAGELAGAVRGTLRDRARARLSRSQPELTSACLNAADIRRSSWYSGPSEISLDDTDLIMSKEAQHIPSSQLSRCSGAGDLASRARRLLRTSADDSVSSRSLASLASLVSSSSGVYEEIAEEEHTYEAIGLYGTARRPKRHPPPLPPRQPYCTLNRGQSWREAEKVTRHSSLGSLTHKQRHHKTFSVFRKRLKSDSRIATSPKSETKDKDVETKKKKFDFTPTRDIFKSFKVSRKMKNLKITSGLAKGETKSCEFLDEAQHVTSNRCSKSVECLEDTYEFLDDYHGNLSLSDADEALALPQEIVELILRGPDLKVRLKDTQCESDYVPMSPIVPILPMVPMVPPPIEHHYMVMSPRTNIA-