John Posada
elchulito89
AI & ML interests
None yet
Recent Activity
upvoted a paper about 10 hours ago
On Data Engineering for Scaling LLM Terminal Capabilities liked
a dataset 1 day ago
applied-ai-018/Coding reacted
to
ajibawa-2023's
post with ๐ฅ 1 day ago
PHP-Code-Large
Dataset: https://huggingface.co/datasets/ajibawa-2023/PHP-Code-Large
PHP-Code-Large is a large-scale corpus of PHP source code comprising more than 12 million lines of PHP code. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and static program analysis for the PHP ecosystem.
By providing a high-volume, language-specific corpus, PHP-Code-Large enables systematic experimentation in PHP-focused model training, domain adaptation, and downstream code understanding tasks.
PHP-Code-Large addresses the need for a dedicated PHP-only dataset at substantial scale, enabling focused research across backend systems, CMS platforms, APIs, and full-stack PHP environments. Organizations
None yet