Monarch geneset OGS2.0

DPOGS200972
TranscriptDPOGS200972-TA1236 bp
ProteinDPOGS200972-PA411 aa
Genomic positionDPSCF300431 + 113467-118978
RNAseq coverage73x (Rank: top 66%)
Annotation
HeliconiusHMEL0161938e-8352.67% 
BombyxBGIBMGA011047-TA2e-11151.83% 
Drosophilamspo-PA5e-3636.65% 
EBI UniRef50UniRef50_E9GZI64e-6135.21%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9GZI6_DAPPU
NCBI RefSeqXP_972867.12e-6432.68%PREDICTED: similar to GA10108-PA [Tribolium castaneum]
NCBI nr blastpgi|910781043e-6332.68%PREDICTED: similar to GA10108-PA [Tribolium castaneum]
NCBI nr blastxgi|910781043e-6533.82%PREDICTED: similar to GA10108-PA [Tribolium castaneum]
Group
KEGG pathwaycin:1001849274e-06 
 K03995 (C6)maps-> Systemic lupus erythematosus
    Complement and coagulation cascades
    Prion diseases
InterPro domain[348-405] IPR0008846.5e-14Thrombospondin, type 1 repeat
Orthology groupMCL25806 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200972-TA
ATGGTCGCCATTTTGTTTTTATTTGTCACGTGGTCAACTTATTGTACAGGATGTGATCTTGACACGGCTGTTTATAAAGTTACAGTGAAATTTTTATGGTGTGAGACAAATTTTCCTAAAGATTATCCTTTAAACCAACCCAAAGCACAGTGGTCTCCTTTATTTGGACAATCTCACAATTCTTCGTACCATCTTTATCGTGTGGGTGAAGTCGCGAGTGAGGCGATGCGAAATTTCGCTCTTTTCGGTAAGCCGGAGGAATTGCTGAATCAAGGTGATGGGGATGAAAGGGTCTATGATCAGTTTCTTGCCCCAGCTGTTGGTGATGGCACAGGCGAAACTGGAAATATAGTATTTTTGGATGGAGAGTATAGCTTGATATCAGTGGCGTGCCGTCTTATACCGTCTCCGGATTGGTTTATTGGCGTTGACAGCTTGGAACTGTGCGTGGATTCGTCTTGGGTGGATCAAATCACACTTGATCTTGAGCCTCTAGATGTCGGAGCGGCCAGCGGTTTATCGTTCACGGCACCTCACTGGGAATCTTTTCAACCAGTCAGCAAACATAGACCACAACAACCTAGTCACCCATCTGCCGCGTTCTATTATCCAGAGTTACGTGAACTTCCCCCTATTGCCAAGATTGAGTTTTTGAAGATAAAGGAGTATTCAATTAGAGAACAAAACGACATGATCCGAGAGGAATTATTCAATAATATAAAGATGAAAGAAAAGTCTATGAATTCACCCAGAAGAAAAGTAAATCTGGTAGAAAATTATGTTAAATATGATACTACTAGAAACTATGCAGATCTAAAAAATTTGGAAGAAGATGAAGTTGCACGTCCAGGCTATAAGCAGGATGACAGCAATGTGCTTGTTGTAACAAAATCACCTTTAATGGAAAATTCAAAGGAATATGGCGCTGGCTTAAGCAGCAATGACGATGTGGTGCTGGCGGTAGCAAATGGTAGGCGATTTGGATTAGGACGGCATCTTCCTCGACACTTCAGATCTCGTCTCCACCATGCAGTTAATAAATTACAGCCTCAAGATTGCCTCGTATCTGATTGGGGGAGCTGGTCATCATGTTCAGTAACCTGTGGTGTGGGTGATCAATATAGAAGCAGATATGTAATCAGGCAGAATACGAGGGACGGCAGAGATTGTCCACCCTTAGCTGATGTCAGACGCTGCAAAAACTTCAACTCTTGTACCCGCGGGGACGGCTACTAA

Protein sequence:

>DPOGS200972-PA
MVAILFLFVTWSTYCTGCDLDTAVYKVTVKFLWCETNFPKDYPLNQPKAQWSPLFGQSHNSSYHLYRVGEVASEAMRNFALFGKPEELLNQGDGDERVYDQFLAPAVGDGTGETGNIVFLDGEYSLISVACRLIPSPDWFIGVDSLELCVDSSWVDQITLDLEPLDVGAASGLSFTAPHWESFQPVSKHRPQQPSHPSAAFYYPELRELPPIAKIEFLKIKEYSIREQNDMIREELFNNIKMKEKSMNSPRRKVNLVENYVKYDTTRNYADLKNLEEDEVARPGYKQDDSNVLVVTKSPLMENSKEYGAGLSSNDDVVLAVANGRRFGLGRHLPRHFRSRLHHAVNKLQPQDCLVSDWGSWSSCSVTCGVGDQYRSRYVIRQNTRDGRDCPPLADVRRCKNFNSCTRGDGY-