This program was made as an assignment to submit for an internship as a test. The code will be explained as detailed as possible. Any issues in the code or its explination are welcome in the Issues section.
The program works on the basis of generated data. In order to achieve this, a random data generator is made such that the data generated is close to what is expected from a good call service provider. Due to this data generation takes time and I would highly recommend that in order to not use parameters that are too large as it would take alot of time to generate it's data.
The accuracy of the program is always questionable. I guess we'll never know until real world data is used and verified.
You can skip the explination and directly go to the user manual if you want but I would recommend reading this once.
This program is used to predict the time a caller would have to wait before the request would be answered. It follows a simple set of steps in order to do so.
- Generate data.
- Calculate the response and abandonment times
- Create data based on number of requests
- Correcting the calculations and predition
Note: It would help to have the code open in another window
If you are looking for a machine learning solution to this problem, you may stop reading this if you feel my justification for not using it is not convincing. If you aren't you can just skip ahead.
I have not used machine learning partly as I don't know any (Do mind its not that I don't want to learn) but mostly because of the following reasons:
- The mammoth ammout of data required and the 1000 new parameters I will have to create to train a model that would give me probably no proper conclusions due to the highly random nature of human beings.
- I am using pseduo random numbers so the model would just find the pattern that generates these random numbers which would be a waste of effort.
- The number ranges of random numbers generated are based on statistics of good waiting times thus apart from correction of the phases that take too long out of the blue this should be able to give resonably good predictions
- For good predictiont the model would have to be very complicated.
- This isn't enough time for me to master machine learning and I dont want to just slap in some statistical model without knowing what it is, and all that I think I know about machine learning is wrong and this would have been much simpler had I used it.
Maybe for another time this could be possible provided the resources from real world to work on it.
Now if you aren't convinced you may leave.
The classes are fairly straight forward.
_agents
is used to store the refernce to the objects created of Agent class.
agent_no
Stores what is equivalent to and id number given to an agent
issue_assigned
Stores None
, when no issue is assigned to it and stores refernce to and object of type Issue
if an issue is assigned to an agent.
Agent.
get_intances(cls)
is a class method that is used to retive _agents
_issues
is used to store the refernce to the objects created of Issue class.
arrival_time
Time at which the issue arrives.
result
Stores 'r' or 'a' indicating responded or abandoned. If issue is responded to response_time
and completion_time
are set. If issue was abandoned abandoned_time
is set.
response_time
Time at which the issue was request was accepted by an agent.
completion_time
Time at which the issue was closed by agent.
abandoned_time
Time at which Issue was abandoned.
avg_response_time(cls)
Class method used to generate the average time taken to respond by all the objects created.
avg_abandonment_time(cls)
Class method used to generate the average time before which a user would leave the request as it was not responded to.
clear_objs(cls)
Class method that is used to delete all instances of class.
push_to_json(cls, file)
Class method used to move all current class methods onto a json file. Used to debug the program and also see the data generated. Note: The json dumped into file is not getting formated for some reason. Uncomment the print statement to print formatted json on the console or if you know the mistake let me know in the Issues section.
** Click on the heading to take yourself to the file containting the code being explained.
Most of the Data is generated using Data_generators.stats_generator.generate_data()
. It works as a control unit calling other functions of Data_generators
. The data generated is for a maximum of 7hrs or in the case of determining waiting time the number of requests.
This function is used for 2 operations, to generate the data upon which perdictions are made and to generate the data before the request whoose waiting time is to be predicted.
The variables h = 9, m = 0, s = 5
are indicating the service starts at 09:00:05. This would not be required really in the real world its just to generate the data. queue
is used to hold all the objects that have arrived but not been assigned to an Agent.
for i in range(no_of_agents):
agent = create_agent(i)
h, m, s = make_time(h, m, s, no_of_agents)
arrival_time = str(f'{h}:{m}:{s}')
req_no += 1
generated_issue = Issue.Issue(arrival_t=arrival_time)
agent.issue_assigned = generated_issue
fill_up_object(agent)
The agents are created using create_agents()
one at a time and the issues are generated and assigned to the created agents. make_time()
(read_more_here) is used to generate time in incrementals specific to the workload meant for given number of agents. The data is filled by fill_up_object()
(more_here).
Making sure that issues generated are below working hours, the issues are created and appended to the queue, the agents are checked for accomodating issues in the queue using agent_queue_handler()
and the cycle continues till the last element is generated.
if no_of_req is not None and no_of_req == req_no:
break
The above snipped is used when a finite number of data must be generated .ie. no_of_req is passed to function call. This is done when the function is used in prediction phase.
The average response time and the average abandonment time is then calculated for the dataset using the respective class functions.
The rest of the function belongs to predictive side of this function and shall be explained below.
The calculations are performed by the 2 Issue class methods avg_response_time()
and avg_abandonment_time()
When the function is called much like most averge functions, the total_reslution_time
is calculated as sum of differences between response time and arrival time. It is then divided by the total no of objects that were taken into consideration .ie. objects with 'r' as result and then returned.
Similar to calulating the average response time, objects that are abandoned are selected using result property ('a') and then summed together and divided by the number of abandonments to give the average abandonment time.
This part involves calling the generate_data()
function with the parameters no_of_req
,avg_res_time
, and avg_aban_time
that were generated from the functions previous execution. The data generation part is the same as explained already.
if no_of_req is not None and no_of_req == req_no:
break
Here the above snippet comes into action limiting the number of requests generated to no_of_req
This part involves making corrections by the new trends and predicting the value.
if no_of_req == req_no:
ele_in_queue = len(queue)-1
avg_response_time = (avg_response_time + avg_res_time) // 2
multiplier = ele_in_queue // no_of_agents + 1
waiting_time = avg_response_time*multiplier
else:
waiting_time = 0
Here first the new value of response time (avg_response_time
) and the recieved value (avg_res_time
) are averaged correcting for the new trend in data. The elements in the queue determine which cycle does a person belong.
ex: There are 18 elements in the queue and 10 agents then,
multiplier = 18//10 + 1 = 1+1 = 2
This means that the person is in the second cycle of the queue and the waiting time would be,
avg_response_time * 2.
So if the average response time is 30seconds, the prediction for this person would be a wait time of 60s or 1min.