Learning Optimal Resource Allocations In Wireless Systems