Monarch geneset OGS2.0

DPOGS203291
TranscriptDPOGS203291-TA1398 bp
ProteinDPOGS203291-PA465 aa
Genomic positionDPSCF300003 - 1527967-1534262
RNAseq coverage780x (Rank: top 17%)
Annotation
HeliconiusHMEL0063926e-6957.25% 
BombyxBGIBMGA012239-TA5e-5847.02% 
Drosophila% 
EBI UniRef50%
NCBI RefSeqXP_972068.22e-2133.16%PREDICTED: similar to AGAP003656-PA [Tribolium castaneum]
NCBI nr blastp%
NCBI nr blastxgi|1892372556e-2926.19%PREDICTED: similar to AGAP003656-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055152e-10protein binding
KEGG pathway 
InterPro domain[200-246] IPR0021722e-10Low-density lipoprotein (LDL) receptor class A repeat
[89-145] IPR0000822.7e-07SEA
Orthology groupMCL20549 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203291-TA
ATGTTCTCTAATCTTAGTGAACAAATTAATTTTATATTTGTGGTGTCATCGAGTAAGATTTTTTGGGACGAGGAAGACGATGTTTCGAATGAGTTCTTAGAAGTGGGCAGCTTCTTGGGAAGGTTTAAACGACAATTAACGGCGACACCACCACCTGAGAACAACGAGAGTGTCACGGCCGATGATGAAGAGTCGATTGATAATGACAATGATTTCGACGAAGGTTCCGGACTCTCGGGTGGCCGGTTGGAGACAGAATTCAAAGAAAAAACCCTTCACGTGTCCTTCGTGGTTAACCAGCCATATCAAACAGAATACTCCAACCGAGACTCAGTGGAATTTCAGAATTTCTCACAGTCTCTCGCTGAAGCTGTTAATGCTGTCTTCCAAAATCTCCCTGGAACCCACAGAGCTAGTCTTGTCAGAATCCAATCCCGGCCAACAGACGAGTTCACATGCAAGGTCACTTTAGAGATAGTTACTATGGGTTACGAAACTGATAGAATTTCCGAAATACTTCGCGACTACATAAGAAAGAAGAGGACGCTTGGAAATGTTGCCGTGGACGATGTGGGTTTCAGTTCCACCGTCATTGATCCAGGAAGTAATACTCCATTGGACATCTGCACTATTGATGAGATAAGATGCTACGATGGTCATTGTGTGCCAGAGACGGCAAGATGTGATGGAAAAAATGACTGTTCCGATGGCTATGACGAACTTAACTGTGCTGATTTGGATAACGAAGATGATCAAGGAGAAGATCAGGACCAAGATTTGGATCAAGATCAGGATCAAGACCAAGACCAAGATCAAGATCAAGATCAAGACCAAGATCAAGACCAAGACCAAGACCAAGACCAAGACCAAGACTTAGACCAAGACCAAGACCAAAACCAAGACCAAGATCAAAATCAAGATCAAGATCTAGATCAAGAACAAAATCAAAATCAAAATCAGGATCAAAATCAAAATCAGGATCAAAATCAAAATCAGGACCAAAACCAGAACCAAGATAACGATCAAAATCAGGGTCCGAATCCAAACCAATCTCAAAATGTAGACAGAGACCCATACCAAACTTCTGAACAATATATTCCTGTAGATAGCGGCAGAGAATCAAATTATGATAATAATGGTAATAATGAAAATTCCGTCGATAACAATGTCAACGGTGTTGATTCTGACGTAGCTACATCGGATCCATTGGATTTTGGACCATCGGAAGATAATCATGAAAACGCCGGAGACCCTAACGTCAATTCCTGGACCGAAAGACCAGGATGTCAAGGCAATGATTTAAGCTGTGATGAAACTCGGTGCATCTCAACTGACATGCGATGTGACGGAAAGCAAGACTGCGACGATGGCACCGATGAAGCTGATTGCCGTAAGTAG

Protein sequence:

>DPOGS203291-PA
MFSNLSEQINFIFVVSSSKIFWDEEDDVSNEFLEVGSFLGRFKRQLTATPPPENNESVTADDEESIDNDNDFDEGSGLSGGRLETEFKEKTLHVSFVVNQPYQTEYSNRDSVEFQNFSQSLAEAVNAVFQNLPGTHRASLVRIQSRPTDEFTCKVTLEIVTMGYETDRISEILRDYIRKKRTLGNVAVDDVGFSSTVIDPGSNTPLDICTIDEIRCYDGHCVPETARCDGKNDCSDGYDELNCADLDNEDDQGEDQDQDLDQDQDQDQDQDQDQDQDQDQDQDQDQDQDQDLDQDQDQNQDQDQNQDQDLDQEQNQNQNQDQNQNQDQNQNQDQNQNQDNDQNQGPNPNQSQNVDRDPYQTSEQYIPVDSGRESNYDNNGNNENSVDNNVNGVDSDVATSDPLDFGPSEDNHENAGDPNVNSWTERPGCQGNDLSCDETRCISTDMRCDGKQDCDDGTDEADCRK-