Comments (5)
- Governmental_Agency finishes with
ValueError: probabilities are not non-negative
reported by Prosimos (this log failed during the previous benchmarking because the time ran out, this time I use another machine for benchmarking, not HPC)
Still running most of the other logs.
from simod.
Also, this time I'm facing a new issue with most of the runs. Before, I ran experiments via SLURM on HPC. Now, it's a simple powerful Linux machine in UT network, and I run experiments with a Python script executing a Simod's Docker container on a Simod configuration file via Python's subprocess.run
.
I'm having the following issue now:
"""
Traceback (most recent call last):
File "/usr/local/lib/python3.10/multiprocessing/pool.py", line 125, in worker
result = (True, func(*args, **kwds))
File "/usr/local/lib/python3.10/multiprocessing/pool.py", line 48, in mapstar
return list(map(*args))
File "/usr/src/Simod/src/simod/simulation/prosimos.py", line 252, in _evaluate_logs_using_metrics
value = compute_metric(metric, validation_log, validation_log_ids, simulated_log, simulated_log_ids)
File "/usr/src/Simod/src/simod/metrics/metrics.py", line 31, in compute_metric
result = get_dl(event_log_1, event_log_1_ids, event_log_2, event_log_2_ids)
File "/usr/src/Simod/src/simod/metrics/metrics.py", line 106, in get_dl
return evaluator.measure_distance()
File "/usr/src/Simod/src/simod/metrics/tsd_evaluator.py", line 64, in measure_distance
distance = self._evaluate_seq_distance(self.log_data, self.simulation_data)
File "/usr/src/Simod/src/simod/metrics/tsd_evaluator.py", line 86, in _evaluate_seq_distance
pool = Pool(processes=cpu_count)
File "/usr/local/lib/python3.10/multiprocessing/context.py", line 119, in Pool
return Pool(processes, initializer, initargs, maxtasksperchild,
File "/usr/local/lib/python3.10/multiprocessing/pool.py", line 212, in __init__
self._repopulate_pool()
File "/usr/local/lib/python3.10/multiprocessing/pool.py", line 303, in _repopulate_pool
return self._repopulate_pool_static(self._ctx, self.Process,
File "/usr/local/lib/python3.10/multiprocessing/pool.py", line 326, in _repopulate_pool_static
w.start()
File "/usr/local/lib/python3.10/multiprocessing/process.py", line 118, in start
assert not _current_process._config.get('daemon'), \
AssertionError: daemonic processes are not allowed to have children
"""
With this explanation https://stackoverflow.com/questions/51485212/multiprocessing-gives-assertionerror-daemonic-processes-are-not-allowed-to-have, it seems that because of the multithreaded evaluation which wants to run also multithreaded TSD (DL) metric computation it can't, because a thread cannot start another pool of threads.
There's a solution to this, https://stackoverflow.com/questions/6974695/python-process-pool-non-daemonic. But it still doesn't seem correct for a thread to run another pool of threads. It can actually slow down the metric computation because of a race for resources.
from simod.
Hi @david-chapela. I'm still running benchmarking, but I've just taken BPIC 2012 to peak into the results.
With timers, graphs look way off. However, average cycle time seems to be closer to the original times.
from simod.
These are the results after the bug is fixed (we've already reviewed them, so this is just for the history):
Production-timers_evaluation_20221222_160233267754.csv
Production-no-timers_evaluation_20221130_180242945366.csv
from simod.
We decided:
- To compare the results when optimizing for extraneous timers BEFORE the calendar optimization
- To use absolute EMD instead of cycle time EMD in the extraneous package as an optimization metric
from simod.
Related Issues (20)
- Add support for fuzzy calendars HOT 5
- Wait for Prosimos to be updated to parse distributions from pix-framework
- Update SplitMiner version to avoid "IllegalArgumentException: Comparison method violates its general contract" exception HOT 5
- Wrong version saved in configuration.yaml
- Investigate long running times when using pooled resources that occurred during benchmarking for Simod 3.5.24 HOT 3
- Unexpected max_cycle_time batching attribute name HOT 1
- Error when getting activities' names and IDs from BPMN model using BPMNReaderWriter and simod.bpm.graph.get_activities_ids_by_name
- Remove BPMNReaderWriter from code
- Repair missing activities in Resource Model when Process Model is provided HOT 2
- Parse correctly the name of the project with compressed CSV files
- How to start Simod HOT 1
- Only consider prioritization/batching if they improve the model performance
- Avoid final evaluation by default
- Analyze excessive print of warning when generating duration distributions HOT 1
- Add simple configuration examples
- Integrate new version of Split Miner without the dependency on X server
- Generate schema for YAML configuration
- Change CLI commands and options HOT 1
- BPIC_2017 running with fuzzy resource profiles and extraneous discovery method switched to naive fails HOT 8
- Simod produces malformed process model HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from simod.