Monarch geneset OGS2.0

DPOGS212378
TranscriptDPOGS212378-TA1233 bp
ProteinDPOGS212378-PA410 aa
Genomic positionDPSCF300019 + 336010-340461
RNAseq coverage334x (Rank: top 35%)
Annotation
HeliconiusHMEL0056651e-0633.72% 
BombyxBGIBMGA004648-TA2e-4234.86% 
DrosophilaCG6171-PB1e-1031.41% 
EBI UniRef50UniRef50_E3WU024e-2229.28%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WU02_ANODA
NCBI RefSeqXP_001849424.14e-1927.48%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3228014494e-2530.30%hypothetical protein SINV_07193 [Solenopsis invicta]
NCBI nr blastxgi|3228014492e-2829.83%hypothetical protein SINV_07193 [Solenopsis invicta]
Group
Gene OntologyGO:00055151.7e-17protein binding
KEGG pathway 
InterPro domain[1-104] IPR0089841.7e-17SMAD/FHA domain
[284-307] IPR0194061.4e-11Zinc finger, C2H2, APLF-like
[12-100] IPR0002533.3e-06Forkhead-associated (FHA) domain
Orthology groupMCL19505 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212378-TA
ATGACTATAAAATTAGTTCGTACGGATACAATATCTCCTTGCAAAATAAATCTTCCCAAAGGTGAACATGTTTTTGGTAGAGGAAAACTACTTGATTGTAACGATAAAAGGATATCACGAGAACACGGTCAGATTATCGTTTCTGATGACTCATTGACAATTAAATCTCTACATCTAAATCCTTGCTTTTTTCAAAAGAAGCAATCCACACAAACTGAGATTCTTAAACTAAATAACACCACAGTTCTAAACAATGGTGACAGATTTGGCTTATTACCTGACTCATATTGGTTTGAAATACTTGTGTGTTATGATGAGAGTAAACATTGTACCGAAGCTAACTCTGAAAATGACACTGAAGAGTTGTGCTTGGAACAAGGTGGATGTGAAACAAGGAGTGAAGTAAAAAATGTACAGCCTAATATAAATCTGAGTGGTGACAATGAAGATACCAATGTTAGACCTGAATCACCGTCTCTATTAGCAAACACAGACAATGGAGTAGGTGCACCAAACAACTGTGTATCCCCGAGTGAGGGGAGTGGTTTAGCAGAGCAGTTGCACGGGTCAGATGATACACAGTTGGTGACACAGAAAGTGGAAGATTTCAAACAATCACCGAGTAAAAGGCCACACAGTCTGGACAATAGTGAAGCAAAGAAAATAAAAACTGAAGAAAATACTGAAGATGAACCAAATATGAAAACTGAACAAACAGTAAAAGATGAACCTGCAATCCCTGAGGATAATACGGAACCAGGTGTCAAACCAGGCTGCAGTACAGATGATACGCAACCAGCTCAGTGTGATGATAAACAAGGGCCAGTCAAACCTGCAAAGCCAAGGGAAAGATGCATGTTTGGTGCACAATGTTACAGGCGGAACCCAACCCACTTGGAGCAGTACAGTCACCCGCAGGATGCCGACTGGGGCGTAGGTGCGCGAGGAGTCTGTCCTTATGGGGCCGCCTGCAGGAGACGGAACCTCATGCACTGGAGCACCAACGACCATCCACCAGGGGTCCTGCCACCGCCACGACCAGGGAAACGAAGGCCGAAGGCACCTGACGAGGATGATGTGCCACAAGATCTGCCCAGCAAAAGGGTTCGGAAACCGGTTCCTAGACCTGACTGGGTCGGTTCAGACTCCGAGCCTGAAGATCCATACGGAACAGATGAATCTGACGAGTGGAAACCCGATAGTAATACCAATTATTCAGATGATTATATATAA

Protein sequence:

>DPOGS212378-PA
MTIKLVRTDTISPCKINLPKGEHVFGRGKLLDCNDKRISREHGQIIVSDDSLTIKSLHLNPCFFQKKQSTQTEILKLNNTTVLNNGDRFGLLPDSYWFEILVCYDESKHCTEANSENDTEELCLEQGGCETRSEVKNVQPNINLSGDNEDTNVRPESPSLLANTDNGVGAPNNCVSPSEGSGLAEQLHGSDDTQLVTQKVEDFKQSPSKRPHSLDNSEAKKIKTEENTEDEPNMKTEQTVKDEPAIPEDNTEPGVKPGCSTDDTQPAQCDDKQGPVKPAKPRERCMFGAQCYRRNPTHLEQYSHPQDADWGVGARGVCPYGAACRRRNLMHWSTNDHPPGVLPPPRPGKRRPKAPDEDDVPQDLPSKRVRKPVPRPDWVGSDSEPEDPYGTDESDEWKPDSNTNYSDDYI-