We studied the sample complexity of community state inference, in which a $ K $-sparse latent state vector $ {\bf x}\in\mathbb{F}_q^N $ over a known community partition is to be recovered from pooled observations $ {\bf y} = {\bf A}{\bf x} $ over the finite field $ \mathbb{F}_q $. The pooling matrix $ {\bf A} $ has a constant row weight of $ d $, with modeling pools formed as linear combinations of exactly $ d $ members. We derived necessary and sufficient conditions on the number of pooled observations $ M $. Let $ \alpha_t(d) $ denote the probability that a row of $ {\bf A} $ misses a fixed set of size $ t $. The lower bound, obtained from Fano's inequality, is governed by $ \alpha_K(d) $; the upper bound, obtained under maximum a posteriori (MAP) decoding, is governed by $ \alpha_{2K}(d) $, thus reflecting the worst-case overlap between two candidate supports of total size $ 2K $. Under the sparse-regime approximation $ \alpha_{2K}(d)\approx e^{-2Kd/N} $, we identified the asymptotic saturation point $ d^{*} = (N/(2K))\ln q $ as the solution of $ \alpha_{2K}(d) = 1/q $, at which the two bounds match to order $ \Theta(K\log_q(N/K)) $ with a multiplicative gap bounded by a constant $ C(q) $ that satisfies $ C(q)\to 2 $ as $ q\to\infty $. The analysis was restricted to the sparse signal regime $ K\to\infty $, $ K/N\to 0 $, $ \log q = o(K) $, and assumes noiseless observations; no practical decoder is proposed.
Citation: Jin-Taek Seong. Finite-field pooling for community state inference: sample complexity bounds at the saturation point[J]. AIMS Mathematics, 2026, 11(6): 18502-18524. doi: 10.3934/math.2026752
We studied the sample complexity of community state inference, in which a $ K $-sparse latent state vector $ {\bf x}\in\mathbb{F}_q^N $ over a known community partition is to be recovered from pooled observations $ {\bf y} = {\bf A}{\bf x} $ over the finite field $ \mathbb{F}_q $. The pooling matrix $ {\bf A} $ has a constant row weight of $ d $, with modeling pools formed as linear combinations of exactly $ d $ members. We derived necessary and sufficient conditions on the number of pooled observations $ M $. Let $ \alpha_t(d) $ denote the probability that a row of $ {\bf A} $ misses a fixed set of size $ t $. The lower bound, obtained from Fano's inequality, is governed by $ \alpha_K(d) $; the upper bound, obtained under maximum a posteriori (MAP) decoding, is governed by $ \alpha_{2K}(d) $, thus reflecting the worst-case overlap between two candidate supports of total size $ 2K $. Under the sparse-regime approximation $ \alpha_{2K}(d)\approx e^{-2Kd/N} $, we identified the asymptotic saturation point $ d^{*} = (N/(2K))\ln q $ as the solution of $ \alpha_{2K}(d) = 1/q $, at which the two bounds match to order $ \Theta(K\log_q(N/K)) $ with a multiplicative gap bounded by a constant $ C(q) $ that satisfies $ C(q)\to 2 $ as $ q\to\infty $. The analysis was restricted to the sparse signal regime $ K\to\infty $, $ K/N\to 0 $, $ \log q = o(K) $, and assumes noiseless observations; no practical decoder is proposed.
| [1] |
M. Aldridge, O. Johnson, J. Scarlett, Group testing: An information theory perspective, Found. Trends Commun., 15 (2019), 196–392. https://doi.org/10.1561/0100000099 doi: 10.1561/0100000099
|
| [2] | P. Nikolopoulos, S. R. Srinivasavaradhan, T. Guo, C. Fragouli, S. Diggavi, Group testing for overlapping communities, ICC 2021–IEEE International Conference on Communications, Montreal, QC, Canada, 2021, 1–7. https://doi.org/10.1109/ICC42927.2021.9500791 |
| [3] |
S. Ahn, W.-N. Chen, A. Özgür, Adaptive group testing on networks with community structure: The stochastic block model, IEEE T. Inform. Theory, 69 (2023), 4758–4776, https://doi.org/10.1109/TIT.2023.3247520 doi: 10.1109/TIT.2023.3247520
|
| [4] |
S. Fortunato, Community detection in graphs, Physics Reports, 486 (2010), 75–174, https://doi.org/10.1016/j.physrep.2009.11.002 doi: 10.1016/j.physrep.2009.11.002
|
| [5] |
B. Karrer, M. E. J. Newman, Stochastic blockmodels and community structure in networks, Phys. Rev. E, 83 (2011), 016107, https://doi.org/10.1103/PhysRevE.83.016107 doi: 10.1103/PhysRevE.83.016107
|
| [6] | E. Abbe, Community detection and stochastic block models: recent developments, J. Mach. Learn. Res., 18 (2018), 6446–6531. |
| [7] |
D. L. Donoho, Compressed sensing, IEEE T. Inform. Theory, 52 (2006), 1289–1306. https://doi.org/10.1109/TIT.2006.871582 doi: 10.1109/TIT.2006.871582
|
| [8] |
E. J. Candes, J. Romberg, T. Tao, Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information, IEEE T. Inform. Theory, 52 (2006), 489–509. https://doi.org/10.1109/TIT.2005.862083 doi: 10.1109/TIT.2005.862083
|
| [9] | D.-Z. Du, F. K. Hwang, Combinatorial group testing and its applications, 2 Eds., Singapore: World Scientific, 2000. https://doi.org/10.1142/4252 |
| [10] |
J.-T. Seong, H.-N. Lee, Necessary and sufficient conditions for recovery of sparse signals over finite fields, IEEE Commun. Lett., 17 (2013), 1976–1979. https://doi.org/10.1109/LCOMM.2013.090313.130753 doi: 10.1109/LCOMM.2013.090313.130753
|
| [11] | A. K. Das, S. Vishwanath, On finite-alphabet compressive sensing, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, Canada, 2013, 5890–5894. https://doi.org/10.1109/ICASSP.2013.6638794 |
| [12] |
R. Ahlswede, N. Cai, S.-Y. R. Li, R. W. Yeung, Network information flow, IEEE T. Inform. Theory, 46 (2000), 1204–1216. https://doi.org/10.1109/18.850663 doi: 10.1109/18.850663
|
| [13] |
S.-Y. R. Li, R. W. Yeung, N. Cai, Linear network coding, IEEE T. Inform. Theory, 49 (2003), 371–381. https://doi.org/10.1109/TIT.2002.807285 doi: 10.1109/TIT.2002.807285
|
| [14] |
G. K. Atia, V. Saligrama, Boolean compressed sensing and noisy group testing, IEEE T. Inform. Theory, 58 (2012), 1880–1901. https://doi.org/10.1109/TIT.2011.2178156 doi: 10.1109/TIT.2011.2178156
|
| [15] |
A. Mazumdar, Nonadaptive group testing with random set of defectives, IEEE T. Inform. Theory, 62 (2016), 7522–7531. https://doi.org/10.1109/TIT.2016.2613870 doi: 10.1109/TIT.2016.2613870
|
| [16] | T. M. Cover, J. A. Thomas, Elements of information theory, Hoboken: John Wiley & Sons, 2005. https://doi.org/10.1002/047174882X |
| [17] |
M. E. J. Newman, Modularity and community structure in networks, Proc. Natl. Acad. Sci. U.S.A., 103 (2006), 8577–8582. https://doi.org/10.1073/pnas.0601602103 doi: 10.1073/pnas.0601602103
|
| [18] |
V. A. Traag, L. Waltman, N. J. van Eck, From louvain to leiden: Guaranteeing well-connected communities, Sci. Rep., 9 (2019), 5233. https://doi.org/10.1038/s41598-019-41695-z doi: 10.1038/s41598-019-41695-z
|
| [19] | Z. X. Zhou, A. A. Amini, Analysis of spectral clustering algorithms for community detection: the general bipartite setting, J. Mach. Learn. Res., 20 (2019), 1774–1820. |
| [20] | H. Qing, J. L. Wang, An improved spectral clustering method for community detection under the degree-corrected stochastic blockmodel, 2010, arXiv: 2011.06374. https://doi.org/10.48550/arXiv.2011.06374 |
| [21] |
X. Y. Lu, B. K. Szymanski, A regularized stochastic block model for robust community detection in complex networks, Sci. Rep., 9 (2019), 13247. https://doi.org/10.1038/s41598-019-49580-5 doi: 10.1038/s41598-019-49580-5
|
| [22] | V. Cohen-Addad, F. Mallmann-Trenn, D. Saulpic, Community recovery in the degree-heterogeneous stochastic block model, Proceedings of Thirty Fifth Conference on Learning Theory, PMLR, 2022, 1662–1692. |
| [23] | M.-J. Lai, D. Mckenzie, A compressive sensing approach to community detection with applications, 2018, arXiv: 1708.09477. https://doi.org/10.48550/arXiv.1708.09477 |
| [24] |
T. J. Wang, J. B. Tan, W. B. Ding, Y. R. Zhang, F. Yang, J. Song, et al., Intercommunity detection scheme for social internet of things: compressive sensing over graphs approach, IEEE Internet Things, 5 (2018), 4550–4557. https://doi.org/10.1109/JIOT.2018.2837048 doi: 10.1109/JIOT.2018.2837048
|
| [25] |
P. Nikolopoulos, S. R. Srinivasavaradhan, T. Guo, C. Fragouli, S. N. Diggavi, Community-aware group testing, IEEE T. Inform. Theory, 69 (2023), 4361–4383. https://doi.org/10.1109/TIT.2023.3250119 doi: 10.1109/TIT.2023.3250119
|
| [26] | R. M. Roth, Introduction to coding theory, Cambridge: Cambridge University Press, 2006, https://doi.org/10.1017/CBO9780511808968 |
| [27] |
D. H. Wiedemann, Solving sparse linear equations over finite fields, IEEE T. Inform. Theory, 32 (1986), 54–62. https://doi.org/10.1109/TIT.1986.1057137 doi: 10.1109/TIT.1986.1057137
|
| [28] |
M. C. Davey, D. J. C. MacKay, Low-density parity check codes over ${\rm{GF}}(q)$, IEEE Commun. Lett., 2 (1998), 165–167. https://doi.org/10.1109/4234.681360 doi: 10.1109/4234.681360
|
| [29] |
D. Declercq, M. Fossorier, Decoding algorithms for nonbinary LDPC codes over GF$(q)$, IEEE T. Commun., 55 (2007), 633–643. https://doi.org/10.1109/TCOMM.2007.894088 doi: 10.1109/TCOMM.2007.894088
|
| [30] | E. R. Berlekamp, Algebraic coding theory, New York: McGraw-Hill, 1968. |
| [31] | J. Stern, A method for finding codewords of small weight, In: Coding theory and applications, Berlin: Springer, 1989,106–113. https://doi.org/10.1007/BFb0019850 |
| [32] |
E. R. Berlekamp, R. J. McEliece, H. C. A. van Tilborg, On the inherent intractability of certain coding problems, IEEE T. Inform. Theory, 24 (1978), 384–386. https://doi.org/10.1109/TIT.1978.1055873 doi: 10.1109/TIT.1978.1055873
|
| [33] | C. Peters, Information-set decoding for linear codes over $\mathbb{F}_q$, In: Post-quantum cryptography, Berlin: Springer, 2010, 81–94. https://doi.org/10.1007/978-3-642-12929-2_7 |
| [34] |
D. L. Donoho, A. Maleki, A. Montanari, Message-passing algorithms for compressed sensing, Proc. Natl. Acad. Sci. U.S.A., 106 (2009), 18914–18919. https://doi.org/10.1073/pnas.0909892106 doi: 10.1073/pnas.0909892106
|
| [35] | S. Rangan, Generalized approximate message passing for estimation with random linear mixing, 2011 IEEE International Symposium on Information Theory Proceedings, St. Petersburg, Russia, 2011, 2168–2172. https://doi.org/10.1109/ISIT.2011.6033942 |
| [36] | A. Coja-Oghlan, O. Gebhard, M. Hahn-Klimroth, P. Loick, Optimal group testing, In: Proceedings of Thirty Third Conference on Learning Theory Learning Research, PMLR, 2020, 1374–1388. |
| [37] | M. Abramowitz, I. A. Stegun, Handbook of mathematical functions with formulas, graphs, and mathematical tables, Washington: National Bureau of Standards, 1964. |
| [38] | J. Scarlett, V. Cevher, Phase transitions in group testing, The Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, Arlington, Virginia, 2016, 40–53. |