Scheduling Jobs
Optimizing your workflow involves the strategic scheduling of various jobs. Below is a guide on how to efficiently handle data uploads, training, and predictions.
Data Upload
New approved registrations should be uploaded daily. This practice keeps your dataset current and maximizes the relevance of your predictions.
After each upload, wait for the validation job to finish, confirming that the data is ready for the next step.
Initiating Training
Full Retrain: Conduct a full training with rebuildModels=True
once per week. This comprehensive approach accounts for any significant changes or trends in the data.
Update Models: Initiate training with rebuildModels=False
if there is any new approved registrations, e.g. every evening. This configuration allows for the inclusion of new data points without undergoing the full training process, optimizing model freshness and performance.
Prediction Strategy
Real-Time Prediction: For immediate feedback, you can ask for real-time prediction for every new registration.
Batch Predictions: Ask for batch predictions on all unapproved registrations nightly. This approach gives you a comprehensive overview, capturing trends and insights from the aggregated results.