Monarch geneset OGS2.0

DPOGS208770
TranscriptDPOGS208770-TA1428 bp
ProteinDPOGS208770-PA475 aa
Genomic positionDPSCF300036 - 904621-909936
RNAseq coverage760x (Rank: top 17%)
Annotation
HeliconiusHMEL0154140.071.28% 
BombyxBGIBMGA007631-TA0.074.84% 
DrosophilaCG2875-PA4e-6133.26% 
EBI UniRef50UniRef50_UPI00021A6C575e-10345.11%UPI00021A6C57 related cluster n=5 Tax=unknown RepID=UPI00021A6C57
NCBI RefSeqXP_394878.27e-10445.05%PREDICTED: similar to CG2875-PB, isoform B [Apis mellifera]
NCBI nr blastpgi|3227858392e-10344.17%hypothetical protein SINV_07970 [Solenopsis invicta]
NCBI nr blastxgi|3838485936e-10345.32%PREDICTED: nucleolar complex protein 4 homolog [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[266-417] IPR0056123.6e-33CCAAT-binding factor
Orthology groupMCL12089 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208770-TA
ATGTTTGAGGCTGAAACGGAAAATTACACCCCGTTGCTTTTAACAATTGAAGTGATCTTTACGGAGCTATTAAAACGAGGTGATTTAGTACAACATATTGAACCCTTAAAACCAATAGACCGTAGTCCTGAGGCAGAATATACAAGATGGCTCAACGAGTGTTATGAGACCGCTCTCTCCCGTGTATTGGAATGTATTCGACGCGGACGCACCAGCTCTCGCCTTCAGGCCCTAGTCACATCTTGCAAATTGATGCAAGCTGAGGGAAAATATCCTTTAGAACACACCAGTGGTTACTTCTTCCCCTCTGTTAGATTGAAGAATATATTTTTGGTACTCCTGGATTCTGAAATTTCCATGTCAGCACCAATAGCTCGCTTCCAGGAGTTCACAGAGTACAGAGACGTGCAGCAACATGGCCTTAAAGTACTGTCGACACTGGCTTGTCATAAATCTCCATCTCAGACGTACATGCAAAATTATCTAGAGTTGTTCGACAAGTTGTTGGCATCGGAAATACCAGCAGAAGTGAGGAAGACAAAAGATAAGATCGGTGAAGAAGATTTCAAAGTACTGTGCGCTAATGAAGGCAAGCCGTCTTTCCCATACAACACATCCGTGTGTCGTCGTTATGCCAACCGTTGCTGGGGTTTCTCCTGTCAGTGGCCCTTATGTGAGTCCCCTCGGTCTCATCGCCGCGCCCTAGTGCTTCTCGTTGAGAAACTGATGCCGCTACTGAACAAACCTCACCTGGCCACCGACATGCTCTGTGACAGCCTGGACGCGGGTGGTCCTATCAGCATGTTGGCTCTGCAGGGCATGCTGGAGCTGGTCCGTCACCACAACATCGACTACCCGGACATGTACGACCGCCTGTACGCCATGTTCGAGCCGGAGATGTTCGCCACCAGATACAAGAAACGCCTCATCCACCTCGCCGATATATTCCTGAGTTCCACTCACCTGCCCGAGAGTCTGGTGGCAGCCTTCGCTAAGCGTCTGTCTCGCCTGGCGTTGGTGGCGTCTCCCGAAGACGCCATGGGACTGTTGCAACTGGTGGGGAACCTTCTACTGAGACATACTGCACTGAAACGAATGATTTGTTGCGAGGACACGCCCGCTGTCATGTCTAACGACCCCTACGTGATGGAGGAGTCTTCTGCGTCGCGGTCCAGAGCCCTGGGTTCGTCTTTATGGGAGGTGCGAGCCTTGACGCGGCACTGGCAGCCCACGCTGGCCACCGTCGCCAGACAGGTCACTGACCCTGACAGGCGAGCCCCCATCGACATCGATCATGCTGGAGAAGAGATGTTCGATGCGGAACTAAAGAAGAGGTTCAAGACGATAGAAGTGAACTTCATACGTCCTCAGAGTATGTCGCTGCCGTCCGGGGAGAGACTCGCGCAGTACTGGGAGATAATGGCCTGA

Protein sequence:

>DPOGS208770-PA
MFEAETENYTPLLLTIEVIFTELLKRGDLVQHIEPLKPIDRSPEAEYTRWLNECYETALSRVLECIRRGRTSSRLQALVTSCKLMQAEGKYPLEHTSGYFFPSVRLKNIFLVLLDSEISMSAPIARFQEFTEYRDVQQHGLKVLSTLACHKSPSQTYMQNYLELFDKLLASEIPAEVRKTKDKIGEEDFKVLCANEGKPSFPYNTSVCRRYANRCWGFSCQWPLCESPRSHRRALVLLVEKLMPLLNKPHLATDMLCDSLDAGGPISMLALQGMLELVRHHNIDYPDMYDRLYAMFEPEMFATRYKKRLIHLADIFLSSTHLPESLVAAFAKRLSRLALVASPEDAMGLLQLVGNLLLRHTALKRMICCEDTPAVMSNDPYVMEESSASRSRALGSSLWEVRALTRHWQPTLATVARQVTDPDRRAPIDIDHAGEEMFDAELKKRFKTIEVNFIRPQSMSLPSGERLAQYWEIMA-