Monarch geneset OGS2.0

DPOGS201120
TranscriptDPOGS201120-TA3060 bp
ProteinDPOGS201120-PA1019 aa
Genomic positionDPSCF300137 + 258473-284802
RNAseq coverage657x (Rank: top 19%)
Annotation
HeliconiusHMEL0179780.085.58% 
BombyxBGIBMGA013670-TA3e-4566.15% 
Drosophilapx-PA3e-12448.43% 
EBI UniRef50UniRef50_B0WIZ23e-13755.01%Putative uncharacterized protein n=3 Tax=Culicidae RepID=B0WIZ2_CULQU
NCBI RefSeqXP_393156.37e-15855.05%PREDICTED: similar to plexus CG4444-PA [Apis mellifera]
NCBI nr blastpgi|3287900421e-15655.05%PREDICTED: hypothetical protein LOC409658 [Apis mellifera]
NCBI nr blastxgi|3838614420.042.53%PREDICTED: uncharacterized protein LOC100876997 [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL14841 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201120-TA
GACGAACCGAAGGCGTATATCAAGAGCTCCCTGAGCGGCGGAGGGGCGGCGGCGTCCCCCGCGCCCCGCGACCCGCCCGCCGAGGAGAGGGTCTGCTTCGTGTGCGGGGGCGCTGGCTTCGGAGACCAGTACACCATCAGGGTGAAGCCTGACCCGCAGCAAGCCTCGGAGCCGTACTTCCCGTTCCTGTGCGGCCATTCGGCGCCGCTGGGCTACAGGCCGGAGGGCGACCACGAGGGCTCGGTGAGGGCGTGCTGCGTGTGCTACACCTTCCTCAGACAGCAGTGGGAGCAGTACGACAGGGAGAACAAGCCGCACAGCCAACGGTTCTACTGGATGAAGAGGCTGGACGGGAAGCCGTTCATAGGAGCGGATATGTCATTCCAAGGGGAGTACGCCGCGCAAGTCTTGGGCCTGGGGGCGGAGCGCGAGGACTCCAGGCCGAACGAAACAGCTCGGGTTACCGCAGATACTAAGGATAACAGCAGCGATTCCCCCCCACCAGCGGCGCGGGTGGTGGTGGTGGGGGTGGTGGGGGTGGCGGTGCGGTGGTGGTGGTGGGGCGGGGGCGAGGAGGAAGTGCTGGACCTCCGCTGCGAGCCGTCCGTGATCTCGGTGGGGTCGCAGGCGTCGGGCGCCAGCGAGCCGGCCGGCTTCCTCGAGGCGGTCCACAGCACGGCGTCCGTTTACTCGGCCGCCTCCACCTCCTCCTCGGAGCGGGACATCCTCGACCTGTCCATGCCCGACAAGAACTGTATGACGGAGGTGTGCTACGTGTGCGGAGACGAGTACAGGAGAGGCTCGCTCGCCAACCTGCTCACCAACGAGCCCAAGGATAAGGCGAGCAAGCAGGCGTACTTCCCCATCTTCGGCGAGCAGCACCCCCGGCCGGCGCGCTCCCGGCCCAAGGACGCGGCGGGCACGGTCCGCGCCTGCGCCGCCTGCCACCACCACCTGCTGCAGCAGTGGAACGTGTACCAGCTCTGCGCCGTATCGCAGTCAGTGCGTCCTGTCCGTTCCGCGGAGCCGAACGCGTCGCTCCGGCGATACACTCATAAACAATTGTGGGCTTCACTAATACCCGAGTGCTGTGTACAGCGAGGGCCCACGGCCCACTCGCAGGGCTCCGCTGGGGTCGCGGTTGTGTGTGACGGTACTTATGTGTCGGACTGTGTTCACTGCAACATGTCGCCAGCCACGGAGAAGATGCGCGGCGTGCCGGCCCACGAGCGCGTGTACACGCTTCGCACGCGGCCGCCGTCCCTGCTGCCGCCGTCCCCGCTGAGCCTGGCATCTTCGGGGGACAAGTCGGTGGTGACGCACGCCCACTCCCCGTCCCTGGAGCGGCCGCCGTCCCAGGCCACGGCGAGCTTCGTGTGCTACGTGTGCGGGGTGTCCGCACCCAGCAGTCAGCTGCGGCTGGTGTACTGCTGCCCCAATCCCGAGCGCGAGCCCTACTACCCTTTCATCACCACCCTCAAGCCTCACCCGGACGCCAGCCCCATAAGTCCTCAAGGCATGGTGCAGATCTGTGCAGCGTGCTACAAGAGTATCCCGCACAAGTATCCCGTGTATGGAGACGAGCCCAGGGACCACACAGTCAACCACACAGCGAACCACGCCGCGCACATACGTTTCAAACCATACGAACTGAAGGCCGGTGCCGTCGCCAAGCGTCCGTCCAGCGCGCTCGCCACCAGCCCTCACAACCAGCTCGCTGAGAACGGAATGGGGCTCTACAGATGTCACGTGTGCGCGGGCCTGTTCCCGCAGCAGTCCATGGAGTGGTTGTCGACGTCCGCGGAGTACATGAACTCGCACGCGATGCACTTCCCGTGCCTGGCGGGCGGCGGGTCTCGCGGCGGGCGGGTGCTGGCGTGCAGCCGCTGCGTGCGACACCTGGCGCGCCAGTGGGAGCTCATGGACGCCGAGCGCGTGCCGCTGGAACACCGCCGGTACAACATCCCCTCCCCGCTGCCCGCCAACTCCGGAATGAGCGGGGACCGGGTCATCCCCACCCCGCCCTCCACCACCTCGGACAGGACGGTCGCCAGCAACACCGCGTGCACGTCGATCTACTGTTTCCTGTGCGGGCTGCACTCCGACCTCACCCTGGCCCGCGTACTGTTCGGTCAGCCTCAGGGAAACGCTCCCTACTTCCCCTGCCTCCTCACACACCAGAGTCACCCTAACGCGGAGCAGCTCCGGGAGAACGGCAGCGCCCTGGTGTGCACCTTCTGCTATCACTCGCTGCTGGCGCAGTGGAGGCGGTACGACGCGCTGGGGGGAGTCGCGCCCGAGAGGAGGGCCTACAACGCCCACGACTACCACTGCCATCTGTGCGGCATCAAGACCTACAGGAAACGAGTGCGCGCGCTGCCCATCAAGGAGTTCCCGTTCCTGATGCAGCGGCGGACGGAGAACTCGCTGCTGCTTGAGAACGGAGACTACGCGGTCGTGTGTCTGGACTGCTACGAGAGTCTACGGACGCAGGCCGCGGACTACGAGCGGCGCGGCGTGCCCGTGGACAAGAGGGAGTACAACTGGCTGCAGCAACCGCCGCCGCCCGAAGACAGCGCCGACGTCACCATCGCCCGGCTACCCTCGGGAGACCGCTCCGACAAGCTGATACCGCAGTCGCTGGTGGTCGGCCGCGGGAAGAGACACAGTCCCAAGCACGCGCCCGCTGACAGGAGGCACAAGAACGACAAGAGTGACGCGGGTGAGTACACACACAGAGACACAGACACAAGGTTCCTGCAGTGCCGTGCGTGTGCTGTATCGATAGCAGATAGTGATGTGCCTGTGAGTGTAGCGAGAGATGGATTACCGACCCCCCCACCCGATGATGAGCGTCGGGGCGCCGTACTACCCCCACACAGGCGGCCCGTCTGGTCGCCGGTCGCCCCTCTTTGCGCGCGCACCGTGCCCGGTGGCACAGGACACGAGCGACCGGCGTGGTCGTTCGCCCTCACCGCCTCCGGTGCTGCCGGAGGGGTGTCACAGGGTGTTGTCTGTAACTCGATCCGGTCGGAAATGTCCGTCAGATTTGTCGCTTGA

Protein sequence:

>DPOGS201120-PA
DEPKAYIKSSLSGGGAAASPAPRDPPAEERVCFVCGGAGFGDQYTIRVKPDPQQASEPYFPFLCGHSAPLGYRPEGDHEGSVRACCVCYTFLRQQWEQYDRENKPHSQRFYWMKRLDGKPFIGADMSFQGEYAAQVLGLGAEREDSRPNETARVTADTKDNSSDSPPPAARVVVVGVVGVAVRWWWWGGGEEEVLDLRCEPSVISVGSQASGASEPAGFLEAVHSTASVYSAASTSSSERDILDLSMPDKNCMTEVCYVCGDEYRRGSLANLLTNEPKDKASKQAYFPIFGEQHPRPARSRPKDAAGTVRACAACHHHLLQQWNVYQLCAVSQSVRPVRSAEPNASLRRYTHKQLWASLIPECCVQRGPTAHSQGSAGVAVVCDGTYVSDCVHCNMSPATEKMRGVPAHERVYTLRTRPPSLLPPSPLSLASSGDKSVVTHAHSPSLERPPSQATASFVCYVCGVSAPSSQLRLVYCCPNPEREPYYPFITTLKPHPDASPISPQGMVQICAACYKSIPHKYPVYGDEPRDHTVNHTANHAAHIRFKPYELKAGAVAKRPSSALATSPHNQLAENGMGLYRCHVCAGLFPQQSMEWLSTSAEYMNSHAMHFPCLAGGGSRGGRVLACSRCVRHLARQWELMDAERVPLEHRRYNIPSPLPANSGMSGDRVIPTPPSTTSDRTVASNTACTSIYCFLCGLHSDLTLARVLFGQPQGNAPYFPCLLTHQSHPNAEQLRENGSALVCTFCYHSLLAQWRRYDALGGVAPERRAYNAHDYHCHLCGIKTYRKRVRALPIKEFPFLMQRRTENSLLLENGDYAVVCLDCYESLRTQAADYERRGVPVDKREYNWLQQPPPPEDSADVTIARLPSGDRSDKLIPQSLVVGRGKRHSPKHAPADRRHKNDKSDAGEYTHRDTDTRFLQCRACAVSIADSDVPVSVARDGLPTPPPDDERRGAVLPPHRRPVWSPVAPLCARTVPGGTGHERPAWSFALTASGAAGGVSQGVVCNSIRSEMSVRFVA-