Monarch geneset OGS2.0

DPOGS208796
TranscriptDPOGS208796-TA1176 bp
ProteinDPOGS208796-PA391 aa
Genomic positionDPSCF300036 - 404805-430942
RNAseq coverage3182x (Rank: top 4%)
Annotation
HeliconiusHMEL0128189e-12688.21% 
BombyxBGIBMGA007658-TA4e-1260.00% 
DrosophilaCG8180-PA1e-1625.66% 
EBI UniRef50UniRef50_E0VMV48e-2734.08%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VMV4_PEDHC
NCBI RefSeqXP_002427448.11e-2734.08%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420135113e-2634.08%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420135111e-2733.70%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00055158.2e-08protein binding
KEGG pathway 
InterPro domain[193-234] IPR0021728.2e-08Low-density lipoprotein (LDL) receptor class A repeat
Orthology groupMCL20466 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208796-TA
ATGTTGACCGGTTTCTTAGTGTTTTCATTAATTTCACTTCAAACCTTTCAATTTGGTTATGCCTCCAGCCTGCTGTGTGGGCGCACCAAGGAAGTGGAGGTGGGTAGCAACGTCGGCGCAGGCGCGGCGCTGGCGCTGACCTTACTGCGACCACAAGCAGCTACAACCAGCGGCCCTTGCCAGCTGAAGCTGAACGCACCGGACGCAGCTGCCTTTACCGTCCGGCTTATTGATGTCAAGGAATCTCTGTCTGATTGGGAGTGGAGTCAGTACGAAGGTCCGGGCGCTCGCAAGTGGAGCGGCCGCAGCGCCCCGGCCCAGGACGCAGTCGACGACAACACTCGCATCTCAGTCGCAAATAACACTGACTCTTGTAAACTACTCGTTTATATAAGCGACACGAAGACACCGATATGGCGTTTGTCCCTGTGCGGAGGAAACGCAGCGGCTGTGGCCGCGAAGGCTGGAACCAAATTGCTGCCACCAAAGATAAAGCTAGTATGGACACCTCCCACAACCACCAACCACCACACGGAGAAACTAAGGCTGGTTGTCACCGCTGTTAATAGTGGTTCTGTATGTAACAATGAATCTCAGTTCGTGTGCGGCGTCACAAGTCTCTGTATATCTTCATCCTTGGTGTGCGATGGTGTGAAGCACTGTCCGGGCGGTGAGGACGAAGATGGCAGTGCATGTTCACATCGCAGGGACTCTCCACTGCTGGAGATGTTGAGACGGTTCGCGGCCAGGAACCAGGAGTTCCTGGGTTTGGACCAGCCAGATGGTGTGACCAAGCCGTCGGTTATAATGAAGATAACGGAGGGGGAGCAAAAACAGAATGCGTTTATGGAGTTTGCAGCGGCCCTGAAGCCTTATGGACCCTGGAGCTACCTCGTGGTGGGAATGTTGGTCTGCGCTACCATACTCATGTTCTGTCTTGCTTGGGAATGTTGCTGCAAGCGTTCCAAGCCTTCAGACACTCCTATCAACATACCGGCCTCTTGCATCGACCTGTCACCGACGGTCACGGTCACGGCGTCCTCTCAGCAGCTGTTCGAGCCGGCGCCTTCGCCCCCCGAGTACGAGCCCCCTCCATCATACTCCTCTCTCTTCCCACGAGCCTACAAGTCCTCCCCGTCCCCCGTCCCACACTGCTCCCACCAGGAACCCGACTGA

Protein sequence:

>DPOGS208796-PA
MLTGFLVFSLISLQTFQFGYASSLLCGRTKEVEVGSNVGAGAALALTLLRPQAATTSGPCQLKLNAPDAAAFTVRLIDVKESLSDWEWSQYEGPGARKWSGRSAPAQDAVDDNTRISVANNTDSCKLLVYISDTKTPIWRLSLCGGNAAAVAAKAGTKLLPPKIKLVWTPPTTTNHHTEKLRLVVTAVNSGSVCNNESQFVCGVTSLCISSSLVCDGVKHCPGGEDEDGSACSHRRDSPLLEMLRRFAARNQEFLGLDQPDGVTKPSVIMKITEGEQKQNAFMEFAAALKPYGPWSYLVVGMLVCATILMFCLAWECCCKRSKPSDTPINIPASCIDLSPTVTVTASSQQLFEPAPSPPEYEPPPSYSSLFPRAYKSSPSPVPHCSHQEPD-