Monarch geneset OGS2.0

DPOGS210671
TranscriptDPOGS210671-TA1956 bp
ProteinDPOGS210671-PA651 aa
Genomic positionDPSCF300013 - 1518805-1551296
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0222052e-11162.46% 
BombyxBGIBMGA006252-TA6e-8568.44% 
DrosophilaCG34402-PC1e-6929.34% 
EBI UniRef50UniRef50_D6X3G89e-9055.23%Putative uncharacterized protein n=4 Tax=Endopterygota RepID=D6X3G8_TRICA
NCBI RefSeqXP_002425273.13e-8254.72%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|3287836961e-17351.78%PREDICTED: cubilin-like [Apis mellifera]
NCBI nr blastxgi|3287836965e-17551.33%PREDICTED: cubilin-like [Apis mellifera]
Group
KEGG pathway 
InterPro domain[199-321] IPR0008591.2e-26CUB
Orthology groupMCL10438 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210671-TA
ATGGCTATGTCTAAATTATATCGTGCTGGGAACTGTAAATACAACATGCAACCGGGAAACTTACACACAATGAAGACATCAACAGCCCTGATGCTGTTAGTCCACTTATCGCTTCACACGGTTGATGAGGGCCACGCAATAAATCCGAGCTGCACCTGCGTACATTTTACGTCCACTCACGGCAAAGAAAGGGGAACTTTCAGCAGTCCAGATTACCCTCGTCCTTACCCTCAGAACGCATGTTTACTTTACACGTTTCTTGCTGAAGCTCACCAGATTGTTGAACTTGTCTTCACGGACTTTGACATTTACAAGGAACACATGGATTGCAGCAATGGAAACTATCTAAAAGTATATTCGGAGGTTGAAATACATGGTCCCGGCCCTCCCGGTATTAACGAATTTTCAGTCTGGTCCAGGATTCTATGTGGAAATCGTGCTGAAGCACCACCGCCGCTATATTCTCATGGACCAATAATGATATTAGAATTTCAAAGTGGAGAGAAACCTTCCAATGCTTCTGGCTTTATTGGAACATACAGGTTTATCGATCGACGTAACTTTGAGACGGATGGTGTTAAAGTATCGGGGACACAGTGTGATTATGTATTTGCATCGCAAGCAGAGCGTCCTAGTCATGGACGACTATATAGTCCAAGATATCCTTCCAGTTATCCTAATAGCGTTAGGTGCAATTATCATTTTAATGCAAGAAAAAATGAAAGAATAAAATTAGTTTTTGAAGAATTATATCTACAGAAAGGCGATGTAAGTTGTCTTAACCGCGGTGATGTGATCAAAGTTTTTGATGGGAGAAATTCAGTAGCACCAGTCATTTCAATGCTTTGTAATGAAATTGTAGGATATGAAATCCTTTCAACGGGACCAGAGCTTTTAGTGCAGTTCTCATCAAATTCCAAAACACCCGGACAAGGATTTAAAGCAAGTTACCAGTTTCTAGCGAAGGACGCCTCTAGTGCAGAAACTGAAGGTAACAAAAAGCCAAGTGCAATGGATGGTTATTCTTCTGTTGGACCAGCTGTTAGCGCAACCACGTCATCCTGTCATCAAGTATTCAGATCTGATAAGAGCAGAAGCGGCAAATTGATCTCGCCGTTATACCCTTCGCCGTACCCTCAAAAGACGCAATGTCATTATGACTTTCTCGCGAAAGGGCGGGAACGCGTGCGATTAGTCTTTGAAGACTTTAATCTACAACGAGCCAGCAGTATTAGCGACTGCGAGAGTATGGACTCATTTGACGTCTTTTTGTATGTGGACGGTCGACTCGAGAAAATGGCGTCATATTGTGGCAATGACGTACCGAAGCCAATAATGTCGAATGGTCCAAAGTTGTCCATCGAGTTTAGGGGTATATACTCGTCAAGATACAGCAGAGGTTTTAAGATAGCGTACTATTTTGTTGAAGATTATGCAATCGCTACGGGAAAGCAGCTTTTAGAGTATCCATGTGCCTTCGTATATAATATCACGGATCGACGAAAAGGGGTTATGACGTCACCAAATTATCCGGGCCTCTACCCTAGGGACACTGAGTGCAATTACTTCTTCCACGCTCGAAAGAACGAGAGAGTGCATCTTAAGTTCTCACACTTTGACGTTGAGGGAGTTGTACCATGCGAAGCTGTCTCGGCGAGTGACTACGTGCAATTCTCTAGTCAAATGATAGATATAGATAGTCAAAGATACTGTGGTCAATTGAGAGAGCTGGATGTTGTATCAAAGAGTAATTTCCTGAGGGTCACCTTCCGTTCCAACGACAGACTGGATGGAACTGGTTTCAAAGCTGAATATATTTTTCTGAAGGACTCTGAAATGCGCAGTGTCAAATCTGAAACAAATGGTTCTGTTGGACTTCACATTAAAAAAGATCATCTATGGAGGAACTTCTTTAGACTGTTGTTAGCTTTGAGCATAGTTACAATAGTATTATAG

Protein sequence:

>DPOGS210671-PA
MAMSKLYRAGNCKYNMQPGNLHTMKTSTALMLLVHLSLHTVDEGHAINPSCTCVHFTSTHGKERGTFSSPDYPRPYPQNACLLYTFLAEAHQIVELVFTDFDIYKEHMDCSNGNYLKVYSEVEIHGPGPPGINEFSVWSRILCGNRAEAPPPLYSHGPIMILEFQSGEKPSNASGFIGTYRFIDRRNFETDGVKVSGTQCDYVFASQAERPSHGRLYSPRYPSSYPNSVRCNYHFNARKNERIKLVFEELYLQKGDVSCLNRGDVIKVFDGRNSVAPVISMLCNEIVGYEILSTGPELLVQFSSNSKTPGQGFKASYQFLAKDASSAETEGNKKPSAMDGYSSVGPAVSATTSSCHQVFRSDKSRSGKLISPLYPSPYPQKTQCHYDFLAKGRERVRLVFEDFNLQRASSISDCESMDSFDVFLYVDGRLEKMASYCGNDVPKPIMSNGPKLSIEFRGIYSSRYSRGFKIAYYFVEDYAIATGKQLLEYPCAFVYNITDRRKGVMTSPNYPGLYPRDTECNYFFHARKNERVHLKFSHFDVEGVVPCEAVSASDYVQFSSQMIDIDSQRYCGQLRELDVVSKSNFLRVTFRSNDRLDGTGFKAEYIFLKDSEMRSVKSETNGSVGLHIKKDHLWRNFFRLLLALSIVTIVL-