REINFORCEMENT LEARNING-BASED HYBRID SPECTRUM RESOURCE ALLOCATION SCHEME FOR THE HIGH LOAD OF URLLC SERVICES