Follow
Semih Cayci
Title
Cited by
Cited by
Year
Budget-constrained bandits over general cost and reward distributions
S Cayci, A Eryilmaz, R Srikant
International Conference on Artificial Intelligence and Statistics. PMLR …, 2020
262020
Convergence of entropy-regularized natural policy gradient with linear function approximation
S Cayci, N He, R Srikant
arXiv preprint arXiv:2106.04096, 2021
212021
Learning to control renewal processes with bandit feedback
S Cayci, A Eryilmaz, R Srikant
Proceedings of the ACM on Measurement and Analysis of Computing Systems 3 (2 …, 2019
182019
Nonbinary polar coding for multilevel modulation
S Cayci, T Koike-Akino, Y Wang
Optical Fiber Communication Conference, W3H. 4, 2019
152019
Group-fair online allocation in continuous time
S Cayci, S Gupta, A Eryilmaz
Advances in Neural Information Processing Systems 33, 13750--13761, 2020
142020
Learning for serving deadline-constrained traffic in multi-channel wireless networks
S Cayci, A Eryilmaz
2017 15th International Symposium on Modeling and Optimization in Mobile, Ad …, 2017
112017
Polar code construction for non-binary source alphabets
S Çaycı, O Arıkan, E Arıkan
2012 20th Signal Processing and Communications Applications Conference (SIU …, 2012
92012
Sample Complexity and Overparameterization Bounds for Temporal Difference Learning with Neural Network Approximation
S Cayci, S Satpathi, N He, R Srikant
IEEE Transactions on Automatic Control 68 (May), 2023
8*2023
Policy mirror ascent for efficient and independent learning in mean field games
B Yardim, S Cayci, M Geist, N He
International Conference on Machine Learning, 39722-39754, 2023
72023
Finite-time analysis of entropy-regularized neural natural actor-critic algorithm
S Cayci, N He, R Srikant
arXiv preprint arXiv:2206.00833, 2022
72022
A Lyapunov-Based Methodology for Constrained Optimization with Bandit Feedback
S Cayci, Y Zheng, A Eryilmaz
Proceedings of the AAAI Conference on Artificial Intelligence 36 (4), 3716--3723, 2022
52022
Hardware-efficient quantized polar decoding with optimized lookup table
T Koike-Akino, Y Wang, S Cayci, DS Millar, K Kojima, K Parsons
2019 24th OptoElectronics and Communications Conference (OECC) and 2019 …, 2019
52019
Learning to control partially observed systems with finite memory
S Cayci, N He, R Srikant
arXiv preprint arXiv:2202.09753, 2022
42022
Continuous-time multi-armed bandits with controlled restarts
S Cayci, A Eryilmaz, R Srikant
arXiv preprint arXiv:2007.00081, 2020
42020
On the multi-channel capacity gains of millimeter-wave communication
S Cayci, A Eryilmaz
2016 IEEE Global Communications Conference (GLOBECOM), 1-6, 2016
42016
Lossless polar compression of q-ary sources
S Çayci, O Arikan
2013 IEEE International Symposium on Information Theory, 1132-1136, 2013
42013
Optimal learning for dynamic coding in deadline-constrained multi-channel networks
S Cayci, A Eryilmaz
IEEE/ACM Transactions on Networking 27 (3), 1043-1054, 2019
32019
Provably Robust Temporal Difference Learning for Heavy-Tailed Rewards
S Cayci, A Eryilmaz
Thirty-seventh Conference on Neural Information Processing Systems, 2023
2023
Stateless Mean-Field Games: A Framework for Independent Learning with Large Populations
B Yardim, S Cayci, N He
Sixteenth European Workshop on Reinforcement Learning, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–19