Monarch geneset OGS2.0

DPOGS200452
TranscriptDPOGS200452-TA1776 bp
ProteinDPOGS200452-PA591 aa
Genomic positionDPSCF300260 - 233175-240418
RNAseq coverage744x (Rank: top 17%)
Annotation
HeliconiusHMEL0044704e-15952.13% 
Bombyx% 
Drosophila% 
EBI UniRef50UniRef50_E9G8Z12e-2127.69%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9G8Z1_DAPPU
NCBI RefSeqXP_002411603.12e-1631.38%conserved hypothetical protein [Ixodes scapularis]
NCBI nr blastpgi|3214732209e-2127.69%hypothetical protein DAPPUDRAFT_315174 [Daphnia pulex]
NCBI nr blastxgi|3214732209e-2227.78%hypothetical protein DAPPUDRAFT_315174 [Daphnia pulex]
Group
Gene OntologyGO:00055157.9e-08protein binding
KEGG pathway 
InterPro domain[130-169] IPR0021727.9e-08Low-density lipoprotein (LDL) receptor class A repeat
Orthology groupMCL34356 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200452-TA
ATGTTTGGTGTTATTAAATATTATATTTTTACGTACATATCAGTACTCGTATTACACGTAGAAAAATGTATATCAGAACCGCAGATGGCGTATCTGGCACTATATAGCCCTGGCGATACAGATTACACGGATGTACTGGTCCGTGGGGAGCATGATAACCTAACGCTGATCTGCGAGATGCGAGGGGATATCACGCCCAGGGTGTTCGTGTGGAACTACGTCGCCGATAGCAACACTACAAGTGCTGGCAGGTCGTTCACCCCAGAGCCCACAGAGGCTGTGGGCAGTCGTATAGAGAAGATTGATCTCCAGCTATCAGACAGTGGTCACTATATGTGTTCAGCGCCACCGTTCAGCGTCACCAAGTACATCCTGGTCCAGCGGAGGAGTCCCCAGTGCGCGCGGGGAGCGTTCTGGTGCGGTCATAGATGTGTGCTGCCCACATACGTGTGTGACGGGTGGTCGGACTGCGAGAACGCCGAGGACGAGGCCCCGCCGATGTGCGCGACGAATCCTTGCGCTGCTCCTGATAAGCTGAACTGCTCCCTCGGTCGCTGCATCTCGGAGGCGGCTTGTTGTTCGTGGCGAGGCGGGGAGAAAGCCCTGTGCCGTCAGCCGTCCTGCTGTGACGAACATCCGAGATACTCACAGGACGGTCTCCTGGAAGTGGAGTATCCTCCGCTGTACGAAGACAGACACTCCCCGGACGACTATGGTTTCATACAGTCGACCATATACACTGTGACTGCTTGCGCCTTGATCTTCATGATAGCGGTGGTTCTGCTGGTGTCCGCGCTGTGTAAGATGCACATGAAGAGGGCCGCGCTACGGGGATACGAGCACGCGCACAGAGACGCCCGCGGATACACTGCCCGTTTCCCTCCTCGCTACGAGGCCGCTCGTCTTATGGAGTCCAGCGTCACCGCCAGTCCGGTCCGGAGTCTCCACCTGGGTTCACCAACAGCTGGTCCCAGTAGCCCTCCAGGTCCCGCACCGTCCAGGGCTTTGGCGGCATTATCTGCTGCACTATGTTCGCGGTACCGACAGGTGCCGACTCAATGCTGTGAAGTGGAAATGAGGGACATAACGATGCCGCAATCATCCGTGACCTCGAGTCCACCAGAGCGACCCTTGACCCTTCAGCTCGGCCGGTTCCACTTCAACATACCAAGGTTCCGCAACGAGAGGCCCGACACTCCTGACATAACGGAAATCAATATAGAGGACCTAGAGTTCATAAGGATGCCGTCCAACGAGACCTACACCCTCAACGGCAGGACGATCAGGCTTTTCGGGGGCAATTTCCAAAACTACCCTCTAATAAACCGGCCGCCCCCCTACAATGAGGCCATGAGGCACAAATTCGGCCCACCGCCAGAATATTTAAGCCACGAGGTCCTCAACAACGACAGCGACGAAGACAACAGTAATATAGAAATGCCGCCGTGCTACGAAGACCTGGCGAGCGGGCTCGGCTCTAATTACCAACCCGCCAACGAAAATGACAGCAATCTGTTCGACACTAGAAGCCTCGTGACGGACCAGGAGACGGATTCTCATGTAACCTGCTTAGAGGAGTTACCGCCTAATATGAACAATAACATAGAGCACGCAGCCTCGTACATATCAGTTATAAACTCAAATACCGAGGACAACAACAACACGTGCGTCATAGACAACGCGGTCGACAATGTAGAGACAATAAGCACAGTCATAGACAACCTGCCAGCCATAGACGACATAAACGCCAACGATTCCATATACGGCGCATGTTGA

Protein sequence:

>DPOGS200452-PA
MFGVIKYYIFTYISVLVLHVEKCISEPQMAYLALYSPGDTDYTDVLVRGEHDNLTLICEMRGDITPRVFVWNYVADSNTTSAGRSFTPEPTEAVGSRIEKIDLQLSDSGHYMCSAPPFSVTKYILVQRRSPQCARGAFWCGHRCVLPTYVCDGWSDCENAEDEAPPMCATNPCAAPDKLNCSLGRCISEAACCSWRGGEKALCRQPSCCDEHPRYSQDGLLEVEYPPLYEDRHSPDDYGFIQSTIYTVTACALIFMIAVVLLVSALCKMHMKRAALRGYEHAHRDARGYTARFPPRYEAARLMESSVTASPVRSLHLGSPTAGPSSPPGPAPSRALAALSAALCSRYRQVPTQCCEVEMRDITMPQSSVTSSPPERPLTLQLGRFHFNIPRFRNERPDTPDITEINIEDLEFIRMPSNETYTLNGRTIRLFGGNFQNYPLINRPPPYNEAMRHKFGPPPEYLSHEVLNNDSDEDNSNIEMPPCYEDLASGLGSNYQPANENDSNLFDTRSLVTDQETDSHVTCLEELPPNMNNNIEHAASYISVINSNTEDNNNTCVIDNAVDNVETISTVIDNLPAIDDINANDSIYGAC-