Monarch geneset OGS2.0

DPOGS210667
TranscriptDPOGS210667-TA1476 bp
ProteinDPOGS210667-PA491 aa
Genomic positionDPSCF300013 - 1707897-1785380
RNAseq coverage107x (Rank: top 60%)
Annotation
HeliconiusHMEL0221846e-13054.76% 
BombyxBGIBMGA006249-TA1e-9681.37% 
DrosophilaCG34402-PC3e-7034.00% 
EBI UniRef50UniRef50_D6X3G81e-10464.73%Putative uncharacterized protein n=4 Tax=Endopterygota RepID=D6X3G8_TRICA
NCBI RefSeqXP_002425273.13e-9256.29%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|3504037152e-15057.67%PREDICTED: cubilin-like [Bombus impatiens]
NCBI nr blastxgi|3504037153e-15157.67%PREDICTED: cubilin-like [Bombus impatiens]
Group
KEGG pathway 
InterPro domain[39-161] IPR0008593.6e-25CUB
Orthology groupMCL10438 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210667-TA
ATGGCGCAGCCTCTGGAGGGCTTCCGCGAAAAGTTTCTGAATCCAGGTCATGATGTTTACATAAAATCTGAAGCAGGTCTGTTCGAAACAGAAGGCGTCCGGATACCCGGAACGGAATGCGACTACCAGTTTAGCCGGTCCGTCAATCGACCCACACACGGCCGGTTGTACAGCCCACGATACCCTTCCATTTACCCGAATAATGTTCGCTGCTCTTATCATTTCCACGCCAGGCCAAAGGACCGGGTGAAAGTTGTTTTCGAAGAAGTCTCCCTACAAAAGGGGGATATAAGTTGCCTGCGTCGGGCGGATATCATCAAAGTGTTCGATGGCAGGGACACAAACGCACCGGCCATTGCAATGCTATGCAACGAATTGACAGGTTATGAGGTCTTATCGACGGGCTCATACTTATTGTTACAATTCACTGCCAACTCCGTGTCCCCGGGACAAGGCTTCGCTGCCACTTTCCATTTCCAACCACCACCAGATTCAACAGCCGCAGATTCCGATCGTCTTCTGAAATTGAGTTTTGGAAAAGCCTTCGAGTCTCTGGGGCCAGAAGTTAGCGCAACAACTTCATCCTGTCACCAAGAGTTCAACAGTGACGGATCAAAACACGGGACATTAACTTCTCCTCATTATCCTTTAGCATATCCTCCTAATACTCATTGTCATTACGAGTTTTTTGGAAGAGGAAAAGAGAGAATAAGATTGATATTTCAAGATTTCTTTTTATTCAAATCTTCGGATGGGTCTGCAGATTGCCGGAACTTGGATTCGCTTCAAGCGTTCGTTAACGTTGATGGGCGGCTAGAAAATGTTGCCACATTTTGTGGGGTCGATCGGCCCCAACCAATAATGTCAAATGGCCCAAAATTGATGTTAGAATTTCGAGGTACCCAGTCTTCGAGACATTCAAGAGGATTTAAAATATCATATTCATTCATAGAGAATTTCGGTATTACAACTGGTAGACAGCTGAAGGAGTTCCCTTGTGCTTTCGTGTATAACAGTAGCGAGTCTCACAACGGTACCTTCGCTTCTCCCAACTCCCCTGGTCTATATCCGAGGGACACAGAATGTTCTTACTTCTTCCACGGAGGGCAGAGTGAGAAAGTTCACTTGCATTTCACCCATTTCGATGTAGAGGGAGTCCTACCTTGTGAAGCTGTGTCGGCAAGCGACTATGTAGAATTTTCAAACTACATGACAGAAGATAATAAGTATGGCAGGTATTGCGGTCAAATGAAAGAATTCCACGTGGAATCTGAGAGGAACTTCTTTAAAGTCACGTTCAGGTCAAACGATAGATTAGATGGAACTGGTTTCAAAGCTATGTACCAATTTTTGGAAGCATCAGAGGAATATCAGGCTCCGATATCTGATAAGGCTTCTGAATCATTGCATTTTTCTGGTGTCATGGGGCTCATAATTGCCGCAGCACATCTGAATCATCAACTATATCATTCATAA

Protein sequence:

>DPOGS210667-PA
MAQPLEGFREKFLNPGHDVYIKSEAGLFETEGVRIPGTECDYQFSRSVNRPTHGRLYSPRYPSIYPNNVRCSYHFHARPKDRVKVVFEEVSLQKGDISCLRRADIIKVFDGRDTNAPAIAMLCNELTGYEVLSTGSYLLLQFTANSVSPGQGFAATFHFQPPPDSTAADSDRLLKLSFGKAFESLGPEVSATTSSCHQEFNSDGSKHGTLTSPHYPLAYPPNTHCHYEFFGRGKERIRLIFQDFFLFKSSDGSADCRNLDSLQAFVNVDGRLENVATFCGVDRPQPIMSNGPKLMLEFRGTQSSRHSRGFKISYSFIENFGITTGRQLKEFPCAFVYNSSESHNGTFASPNSPGLYPRDTECSYFFHGGQSEKVHLHFTHFDVEGVLPCEAVSASDYVEFSNYMTEDNKYGRYCGQMKEFHVESERNFFKVTFRSNDRLDGTGFKAMYQFLEASEEYQAPISDKASESLHFSGVMGLIIAAAHLNHQLYHS-