src.utils package¶

Subpackages¶

Submodules¶

src.utils.django_orm module¶

src.utils.django_orm.duplicate_orm_row(obj)¶

src.utils.event_attributes module¶

src.utils.event_attributes.get_additional_columns(log)¶

src.utils.event_attributes.get_event_attributes(log)¶

Get log event attributes that are not name or time

As log file is a list, it has no global event attributes. Getting from first event of first trace. This may be bad.

src.utils.event_attributes.get_global_event_attributes(log)¶: Get log event attributes that are not name or time

src.utils.event_attributes.get_global_trace_attributes(log)¶

src.utils.event_attributes.unique_events(log)¶

List of unique events using event concept:name

Adds all events into a list and removes duplicates while keeping order.

src.utils.event_attributes.unique_events2(training_log, test_log)¶

Combines unique events from two logs into one list.

Renamed to 2 because Python doesn’t allow functions with same names. Python is objectively the worst language.

src.utils.file_service module¶

src.utils.file_service.create_unique_name(name)¶

Return type:	`str`

src.utils.file_service.get_log(log)¶

Read in event log from disk

Uses xes_importer to parse log.

Return type:	`EventLog`

src.utils.file_service.save_result(results, job, start_time)¶

src.utils.log_metrics module¶

src.utils.log_metrics.avg_events_in_log(log)¶

Returns the average number of events in any trace

:return 3

Return type:	`int`

src.utils.log_metrics.event_executions(log)¶

Creates dict of event execution count

Return {‘Event A’:
	7, ‘2011-01-06’: 8}
Return type:	`OrderedDict`

src.utils.log_metrics.events_by_date(log)¶

Creates dict of events by date ordered by date

Return {‘2010-12-30’:
	7, ‘2011-01-06’: 8}
Return type:	OrderedDict

src.utils.log_metrics.events_in_trace(log)¶

Creates dict of number of events in trace

Return {‘4’:	11, ‘3’: 8}
Return type:	`OrderedDict`

src.utils.log_metrics.max_events_in_log(log)¶

Returns the maximum number of events in any trace

:return 3

Return type:	`int`

src.utils.log_metrics.new_trace_start(log)¶

Creates dict of new traces by date

Return {‘2010-12-30’:
	1, ‘2011-01-06’: 2}
Return type:	`OrderedDict`

src.utils.log_metrics.resources_by_date(log)¶

Creates dict of used unique resources ordered by date

Resource and timestamp delimited by &&. If this is in resources name, bad stuff will happen. Returns a dict with a date and the number of unique resources used on that day. :return {‘2010-12-30’: 7, ‘2011-01-06’: 8}

Return type:	`OrderedDict`

src.utils.log_metrics.std_var_events_in_log(log)¶

Returns the standard variation of the average number of events in any trace

:return 3

Return type:	`int`

src.utils.log_metrics.trace_attributes(log)¶

Creates an array of dicts that describe trace attributes. Only looks at first trace. Filters out concept:name.

Return [{name:	‘name’, type: ‘string’, example: 34}]
Return type:	`list`

src.utils.result_metrics module¶

src.utils.result_metrics.calculate_auc(actual, scores, auc)¶

Return type:	`float`

src.utils.result_metrics.calculate_nlevenshtein(actual, predicted)¶

Return type:	`float`

src.utils.result_metrics.calculate_results_classification(actual, predicted)¶

Return type:	`dict`

src.utils.result_metrics.calculate_results_regression(input_df, label)¶

Return type:	`dict`

src.utils.result_metrics.calculate_results_time_series_prediction(actual, predicted)¶

Return type:	`dict`

src.utils.result_metrics.get_auc(actual, scores)¶

Return type:	`float`

src.utils.result_metrics.get_confusion_matrix(actual, predicted)¶

Return type:	`dict`

src.utils.tests_utils module¶

src.utils.tests_utils.create_test_clustering(clustering_type='noCluster', configuration={})¶

Return type:	`Clustering`

src.utils.tests_utils.create_test_encoding(prefix_length=1, padding=False, value_encoding='simpleIndex', add_elapsed_time=False, add_remaining_time=False, add_resources_used=False, add_new_traces=False, add_executed_events=False, task_generation_type='only')¶

Return type:	`Encoding`

src.utils.tests_utils.create_test_hyperparameter_optimizer(hyperoptim_type='hyperopt', performance_metric='acc', max_evals=10)¶

src.utils.tests_utils.create_test_job(split=None, encoding=None, labelling=None, clustering=None, predictive_model=None, job_type='prediction', hyperparameter_optimizer=None)¶

src.utils.tests_utils.create_test_labelling(label_type='next_activity', attribute_name=None, threshold_type='threshold_mean', threshold=0.0)¶

Return type:	`Labelling`

src.utils.tests_utils.create_test_log(log_name='general_example.xes', log_path='cache/log_cache/test_logs/general_example.xes')¶

Return type:	`Log`

src.utils.tests_utils.create_test_predictive_model(predictive_model='classification', prediction_method='randomForest')¶

Return type:	`PredictiveModel`

src.utils.tests_utils.create_test_split(split_type='single', split_ordering_method='sequential', test_size=0.2, original_log=None, train_log=None, test_log=None)¶

src.utils.time_metrics module¶

src.utils.time_metrics.count_on_event_day(trace, date_dict, event_id)¶: Finds the date of event and returns the value from date_dict :param date_dict one of the dicts from log_metrics.py :param event_id Event id :param trace Log trace

src.utils.time_metrics.duration(trace)¶: Calculate the duration of a trace

src.utils.time_metrics.elapsed_time(trace, event)¶: Calculate elapsed time by event in trace

src.utils.time_metrics.elapsed_time_id(trace, event_index)¶: Calculate elapsed time by event index in trace

src.utils.time_metrics.remaining_time(trace, event)¶: Calculate remaining time by event in trace

src.utils.time_metrics.remaining_time_id(trace, event_index)¶: Calculate remaining time by event index in trace

src.utils package¶

Subpackages¶

Submodules¶

src.utils.django_orm module¶

src.utils.event_attributes module¶

src.utils.file_service module¶

src.utils.log_metrics module¶

src.utils.result_metrics module¶

src.utils.tests_utils module¶

src.utils.time_metrics module¶

Module contents¶