Monarch geneset OGS2.0

DPOGS207558
TranscriptDPOGS207558-TA1518 bp
ProteinDPOGS207558-PA505 aa
Genomic positionDPSCF300072 - 827154-833624
RNAseq coverage611x (Rank: top 21%)
Annotation
HeliconiusHMEL0180216e-5963.59% 
BombyxBGIBMGA004704-TA1e-4540.26% 
Drosophiladlg1-PM7e-0842.19% 
EBI UniRef50UniRef50_E4Y5P84e-1633.68%Whole genome shotgun assembly, allelic scaffold set, scaffold scaffoldA_14 n=2 Tax=Oikopleura dioica RepID=E4Y5P8_OIKDI
NCBI RefSeqXP_970444.22e-2835.71%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|2700076794e-2835.71%hypothetical protein TcasGA2_TC014370 [Tribolium castaneum]
NCBI nr blastxgi|1892374793e-2635.66%PREDICTED: similar to predicted protein [Tribolium castaneum]
Group
Gene OntologyGO:00055154.2e-14protein binding
KEGG pathway 
InterPro domain[91-158] IPR0014784.2e-14PDZ/DHR/GLGF
Orthology groupMCL25056 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207558-TA
ATGCGAGCGGAATTAGTTCAAGCTCGGGCTGCAGGCGCTTGGCAGGCGGGCGGGGACGCGGCGGGTGGTGCGCCCGAGGCTGCGAGGGCGAGGGCCGAAGCCGCACAGCTTGCCAGGGAAAACGCTGCGCTTAGAGAAACTGTGCTCGCGTTGCATTCGGAACTGTTCGGAGCTAGACTAGCAACTAAGTACCTCGATAAAGAACTAGCGGGTAGAATACAACAATTACAACTGTTAGGTAGTGAAATGCGCGCCGAGCTTAGGGACTCGTTGTGGGCCCAGGGTGGTCGTGAGCATGGCGTCCCTATCCTGATCTCGGAGCTAGAGGCTGGCGGACCTGCCGCGCTCACGGGAGACCTCTATGTAGGGGACGCCATCCTAGCTATCAACGATGTTGACCTCACACAGGCTTGTCATAAAGACGCGGTGGAGGCCCTGCAGAGTGTGAAGGGCGACTGTGCTCTCTGTGTACAGTTCATAGCCACTGACGAGGAGGATCGACTCTCAGACGACAACTACAGGTTCGCGCTGTATCCTGAAGAAGACGGTTTCGGAGAGGACGGAGAGGAAGGGGATGCGGCCACGCCCACCGCGCCCAGGACACCGGACTACACACGTAGCGTGTCGTGTAGTGAGGACGAGCACGACATGGCGGTGCAGGGCGGCGGCGAGGCGGGCGCGGCGCTGGCCGGGGCCTCGTACACGGACTACAACATCAACAACCTCTCTATACTGGACATGGACGATGATTCTATCAAGATAGCGCGCCTAGAGATGTCTGGTCGACCATCATCGCTGCCATCGTCCCGCTCCACGCCGGTACACTCGAGGGTCACCGCTAAGAAGAAGAGAAAACCACAGGACTGGCGTACTAAAGGGGCCTACTGTGCTGTATCGGACGCACAATCGTCAGACGACACGCGCGACACTCCCCGACATTATCAGAGTATTCCCTCTCAAGATTTCTACCAATCCAAGTCTGAATACCAGCCTATAGAGACGCCTGTCGGTGATACCTCATGCGGTGTTACTGCTGACGTCACCCACGTGACGACCACCACCGACACCACCGTTACCACCGTCACCGTCAACGACCTACCACCAGCCACCAACAATAACATACCACCCATGAACAACGCAGCTCTCTCCGACGACTTAATCAACGACAACACGTCGTCTCTATCCAACATATCATCGACGACTAACGTGTCACCTAATAATAGTGAAGTGAAGGCTGTCACGTCATCAGTCCCGAAACAAACGACGTTCCACGACACTCAGACGCCAGCACACGAACGTCATCACGCCAGAGTGAACGGCGATAGACAGAGGAGAGAGGTGAAGAGCTTCCGGATAGGTTCAGCGCGCCGCGTCACTGAGAGTGGGATAAGGAACGGAGAGGTGGAGCGACACCCTCACGACTCGTACCCTCACGCTCAGTCACACCCTCACCCTCACCCACCCGGGCCGCGCCTACTGCCGGGTCGAGGAGACCCTGACTTTGGCACACCTGTGTGA

Protein sequence:

>DPOGS207558-PA
MRAELVQARAAGAWQAGGDAAGGAPEAARARAEAAQLARENAALRETVLALHSELFGARLATKYLDKELAGRIQQLQLLGSEMRAELRDSLWAQGGREHGVPILISELEAGGPAALTGDLYVGDAILAINDVDLTQACHKDAVEALQSVKGDCALCVQFIATDEEDRLSDDNYRFALYPEEDGFGEDGEEGDAATPTAPRTPDYTRSVSCSEDEHDMAVQGGGEAGAALAGASYTDYNINNLSILDMDDDSIKIARLEMSGRPSSLPSSRSTPVHSRVTAKKKRKPQDWRTKGAYCAVSDAQSSDDTRDTPRHYQSIPSQDFYQSKSEYQPIETPVGDTSCGVTADVTHVTTTTDTTVTTVTVNDLPPATNNNIPPMNNAALSDDLINDNTSSLSNISSTTNVSPNNSEVKAVTSSVPKQTTFHDTQTPAHERHHARVNGDRQRREVKSFRIGSARRVTESGIRNGEVERHPHDSYPHAQSHPHPHPPGPRLLPGRGDPDFGTPV-