Monarch geneset OGS2.0

DPOGS202476
TranscriptDPOGS202476-TA1107 bp
ProteinDPOGS202476-PA368 aa
Genomic positionDPSCF300326 + 12640-15352
RNAseq coverage1107x (Rank: top 11%)
Annotation
HeliconiusHMEL0147462e-1631.07% 
BombyxBGIBMGA011597-TA1e-4768.90% 
DrosophilaCG15097-PC4e-1827.59% 
EBI UniRef50UniRef50_E0VYN34e-3229.62%Putative uncharacterized protein n=3 Tax=Pediculus humanus corporis RepID=E0VYN3_PEDHC
NCBI RefSeqXP_001605813.11e-3832.78%PREDICTED: similar to ns1 binding protein [Nasonia vitripennis]
NCBI nr blastpgi|1565467933e-3732.78%PREDICTED: influenza virus NS1A-binding protein-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|1565467931e-3433.11%PREDICTED: influenza virus NS1A-binding protein-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00055152.1e-16protein binding
KEGG pathway 
InterPro domain[33-160] IPR0113332.8e-21BTB/POZ fold
[51-158] IPR0130692.1e-16BTB/POZ
[61-160] IPR0002108.2e-13BTB/POZ-like
Orthology groupMCL30854 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202476-TA
ATGTCAGCTGTGTATAGGTCGATACTCAGAGCAGGAGGAGCCAACTCGGCCGCCTCGCGCTCCGAGGACAGCGAGGGGGAAGAGGCGACCGCGCTCGAACAGGAGTTGTCTCTGGAGGACCGCGACGCCCCTGCCCGGGTGCTTCACGCCCTCAACGCGCTCCGCAAGGCGCGGCAGCACTACGACGTGCTGCTGACGGCCGCCGGCGAGGAGGTGGCGGCACACCGGGCCGTACTGGCCGCCTCTTCACCGCGCCTCCTTACGCTGCTGGAGCCCTCCTCCCCGGCCGGCGCCTCCGCTCCTGCCGTGCGCGTGCCTGGCGTGGACCCGGACGCGCTGCGTGAGCTTGTGGAGTACGCCTACACTGGCCGCCTGCGGGTAAAGGACGCGTCCTCTGCACGACGCCTCTACCGCGCGGCCGCTGCACTGCGCGTGGAACACGCGCGCTCACACCTCGCCGACCGCCTGCTGAGGCGACTCACACCGCACGACTGTCTATCGCTGAGAGCGCTGCCCGACCTCGCTGACCATCACCGCTCCCAGCTGGACACCTTCATCGAGAAGAATTTTGACGAAATATGTGAAAGTGGTGCTCTGGCCGCGCTGCCGCTGATAAAGATCGAACTGCTCCGTGAGACGAGCGCGGAGGGCGGGGAGGAGGCGCCGGCCGCGGTCGCCGACGCCGCCCTTACCTGGCTGCGGGACCACACTCCCGTCGACGTAGACCTCGAAGAGCTCTGTTCCCGCACACACCTGCTGTTCGTCGACGTCAAGGGAGAGTTGAGAGACTGCGGGGAACTTCCGGCGGCCAGGGGAGACGCCCCCGAGCTGGAGGAATATAGGAGGGAGGCGCGGGAGAGGGAGAGAGGGCCGAGGAGGCGGGGACGGGATGAAGACGAACCCGCTGGGGAGTGTACCGTCATAGCGGCGAGGGCGGGGGTCGGCGGTGGGACACGGGCGGGGGCGGGCGACGAACGGGAGGAGCCGAGGGTCCTCGCGGGGGACGGCTGGGTGGCGCGAGGGACCCGGAGGGGGCGGACACGACGGCGGGAGACACCAGCGCCCACATGTCCGTGGGGCGGTGCGCTCTGGGGGCGGCGGCGCTGA

Protein sequence:

>DPOGS202476-PA
MSAVYRSILRAGGANSAASRSEDSEGEEATALEQELSLEDRDAPARVLHALNALRKARQHYDVLLTAAGEEVAAHRAVLAASSPRLLTLLEPSSPAGASAPAVRVPGVDPDALRELVEYAYTGRLRVKDASSARRLYRAAAALRVEHARSHLADRLLRRLTPHDCLSLRALPDLADHHRSQLDTFIEKNFDEICESGALAALPLIKIELLRETSAEGGEEAPAAVADAALTWLRDHTPVDVDLEELCSRTHLLFVDVKGELRDCGELPAARGDAPELEEYRREARERERGPRRRGRDEDEPAGECTVIAARAGVGGGTRAGAGDEREEPRVLAGDGWVARGTRRGRTRRRETPAPTCPWGGALWGRRR-