Phongsakon Mark Konrad
Future Researcher with an Entrepreneurial Spirit
About
TLDR: Nothing very special here, anti-big data big data guy, a bit delusional.
My path to research has been, like my life, anything but conventional, beginning in Thailand, moving through Germany, living in Hong Kong for a semester, and now continuing in Denmark. I've served in the military, built startups, and do research. Mix all three together and you get a pretty spicy combination. These experiences revealed that my true passion lies not just in building, but in a deeper question: how do machines actually learn?
I'm finishing my BSc. Engineering (Software Engineering) at the University of Southern Denmark. I work as a Research Assistant and Teaching Assistant in the Data and Intelligence Lab, supervised by Associate Professor Serkan Ayvaz.
I'm an anti-big data big data guy. I don't like the scaling law, it's not elegant, and I want to change that. Maybe that's delusional, but sometimes you have to be, right? I'm interested in the search for soul and consciousness in machines. I am one, and if I have it, why not the current models?
Research Interests
Ranked by what keeps me up at night:
- Efficient intelligence beyond scaling: Investigating whether compact architectures can match or surpass large-scale models, challenging the premise that performance is a function of parameter count.
- Adversarial robustness of alignment: Probing the failure modes of safety mechanisms in language models to inform the design of more resilient guardrails.
- Calibrated uncertainty and self-awareness: Developing methods for models to reliably signal when they are uncertain, confabulating, or operating outside their competence.
- Adversarial cognition as a training signal: Exploring whether training models to reason critically and adversarially yields improvements in both performance and trustworthiness.
- Operationalizing intelligence: Constructing rigorous, falsifiable benchmarks that move beyond task performance toward evaluating genuine reasoning capacity.
- Synthetic affect and machine consciousness: Examining whether incorporating analogues of emotion and subjective experience into model architectures influences learning dynamics and downstream performance.
News
- Jan 2026 Back in Denmark after exchange at HKUST.
- Sep 2025 Started exchange semester at HKUST in Hong Kong.
- Sep 2025 Presented at KES 2025 in Osaka, Japan.
- Aug 2025 Top Ten in DMiAI 2025 (Danish National Championship in AI).
- May 2025 My first paper has been accepted by KES 2025! See you in Osaka :)
- May 2025 Participating in MIT Global AI Hackathon 2025
- Apr 2025 Accepted for exchange at HKUST! Studying in Hong Kong Sep-Dec 2025.
- Oct 2024 Top Ten in DMiAI 2024 (Danish National Championship in AI).
- Sep 2024 Started Research Assistant position at SDU under Associate Professor Serkan Ayvaz
- Nov 2023 Won SDU Case Competition
Publications
Abstract
Flood detection using satellite imagery is crucial for environmental monitoring and disaster management, especially in rural and agricultural regions where even minor water inundation can disrupt farmland accessibility and road safety. Sentinel-1 Synthetic Aperture Radar (SAR) imagery offers a robust solution for mapping water under various weather conditions. Although deep learning-based segmentation methods have shown promising results for flood detection, their comparative performance in agricultural landscapes, including small-scale surface water dynamics, remains underexplored. In this study, we introduced a three-class segmentation framework that distinguishes sea, inland water, and land, improving the flood detection accuracy in complex coastal farmland. Ten different deep learning models were evaluated for segmentation using Sentinel-1 VH polarization decibel values. We further investigated anomaly detection via autoencoders and variational autoencoders to track temporal changes in flood-prone areas. The evaluations showed that the DeepLabv3+ and hybrid ResNet-UNet models outperformed the others.
Abstract
Deep learning models are notoriously data-hungry, which presents a fundamental obstacle for specialized medical tasks where annotated data is scarce. We perform a carefully controlled experiment on a clinically important problem: segmenting the layers of the artery wall from only nine annotated histology images. We compare a variety of standard convolutional architectures, pre-trained on a large public histology corpus, against a vision foundation model guided by a systematic and reproducible prompting curriculum.
Abstract
Gastrointestinal (GI) imaging modalities including endoscopy, colonoscopy, and wireless capsule endoscopy provide vital diagnostic information, yet their manual interpretation remains laborious. Although deep learning approaches, particularly convolutional neural networks (CNNs), hybrid architectures, and transformer-based models, have achieved high accuracy in this domain, their clinical adoption is impeded by data limitations and trust concerns. The current study systematically maps algorithmic trends to specific GI imaging techniques, quantifies the relationship between dataset size and model performance to identify data sufficiency thresholds, and evaluates translational enablers such as federated learning for data privacy and explainable AI for clinician trust.
Abstract
Ad hoc dataset search requires matching underspecified natural-language queries against sparse, heterogeneous metadata records, a task where typical keyword or embedding retrieval alone falls short. We present a reference architecture for agentic hybrid retrieval that combines BM25 lexical search with dense-embedding retrieval via reciprocal rank fusion, orchestrated by a large language model controller that repeatedly plans queries, evaluates result sufficiency, and reranks candidates. To reduce the vocabulary mismatch between user intent and provider-authored metadata, we introduce an offline metadata augmentation step in which an LLM generates pseudo-queries and structured pseudo-descriptions for each dataset record.
Abstract
Large language models are increasingly used as software architecture co-pilots, yet no benchmark evaluates their cloud-native architecture knowledge. We present CAKE, 188 expert-validated questions spanning four cognitive levels (recall, analyze, design, implement) and five cloud-native topics. We tested 22 model configurations (0.5B-70B parameters) from four families, using three-run majority voting for MCQ and LLM-as-judge scoring for free-response items. We observe four findings: (1) MCQ accuracy plateaus above 3B parameters, with 15 of 22 configurations exceeding 90%; (2) free-response scores scale steadily across all cognitive levels; (3) the two formats capture different facets of knowledge, as MCQ approaches a ceiling while free-response continues to differentiate models; (4) reasoning augmentation (+think) improves free-response quality, while tool augmentation (+tool) degrades performance for small models.
Abstract
The current AI coding agents select frameworks, scaffold infrastructure, and wire up integrations. These are architectural decisions, yet almost no one reviews them as such. We study the problem from two angles. First, we survey agentic coding tools and identify five mechanisms by which they make implicit architectural choices. The case study shows that changing the prompt alone produces structurally different systems. Second, we analyze prompt-architecture coupling. The way prompt choices in LLM-integrated code dictate what infrastructure the system needs. Six recurring patterns arise from this analysis. Natural-language instructions now appear to shape system structure directly. We call this vibe architecting, and we outline review practices, decision records, and tooling aimed at bringing these hidden decisions under governance.
Abstract
We propose a novel multi-layered AI framework for day-ahead electricity trading in European power markets, focusing on the Nordic region. Our framework combines three complementary components: (i) Machine Learning(ML)/Deep Learning(DL) forecasters modelling prices, load, weather and renewable generation; (ii) Reinforcement Learning (RL) agents choosing trading or bidding strategies based on these forecasts and world states (weather, current price, current load and current renewable generation); (iii) a layer of debating Large Language Models (LLMs) agents that debate about the input and output of previous layers, integrate textual news and produce final recommendation via an LLM judge. This design enables the system to 'see' through the forecasting models, 'act' through RL agent and 'explain' through language-based reasoning. We use publicly available data from Nord Pool, ENTSO-E and Energinet to evaluate each layer individually and in combination to improve both trading performance and interpretability.
Abstract
Hyperspectral imaging enables non-destructive assessment of fruit ripeness and firmness, yet most published methods rely on deep learning architectures that are difficult to deploy in commercial sorting environments. This study benchmarks 19 traditional machine learning algorithms on dual-task prediction (ripeness classification and firmness estimation) using the DeepHS Fruit dataset with 1,120 engineered VIS-NIR spectral features across five fruit species. Through a 6-phase experimental design spanning 190 preprocessing configurations, 3,800 Bayesian optimization trials, and 10-fold cross-validation, we find that preprocessing strategy consistently matters more than algorithm choice: stratified resplit balancing and spectral feature engineering together account for the largest performance gains, while PCA dimensionality reduction degrades accuracy by 2.52 percentage points on average by discarding discriminative spectral information. The best single model (ExtraTrees with stratified resplit, no PCA) achieves 75.00% overall accuracy, surpassing Fruit-HSNet's reported benchmark of 70.73% while training in 0.31 seconds on consumer hardware. An RGB-only comparison reaches 69.57% accuracy (93% of full-spectrum performance), though this represents an upper bound from bands extracted from hyperspectral cubes rather than true RGB cameras. Explainable AI analysis via SHAP, LIME, and permutation importance reveals concentrated feature importance in visible regions associated with pigment changes and near-infrared regions related to tissue structure and water content, linking model decisions to known indicators of fruit maturation. These results demonstrate that carefully preprocessed traditional ML pipelines can match or exceed published benchmarks for spectral fruit quality tasks, offering a practical, interpretable, and resource-efficient alternative for commercial deployment.
Education
- COMP4211 Machine Learning
- COMP4471 Deep Learning in Computer Vision
- COMP4901B Large Language Models
- COMP4901Z Reinforcement Learning
- COMP6411D Data Visualisation (Postgraduate)
- A practical, project-centric curriculum where theoretical knowledge is applied in mandatory, semester-long team projects
- These projects involve developing complex, data-intensive software systems in domains like IoT and AI, often in collaboration with industry partners
Research Experience
- Developing end-to-end machine learning projects, from co-initiating concepts to building scalable ML pipelines and executing experiments
- Co-authoring academic papers for publication, contributing to manuscript drafting, literature reviews, and the revision process
- Assisting in the drafting and preparation of grant proposals to secure research funding
- Managing the full research data lifecycle, including multimodal data collection, processing, and documentation
- Supporting course delivery through syllabus planning, lab exercise development, and direct student instruction
- Designing hands-on lab assignments and contributing to curriculum development
Professional Experience
- Led the end-to-end development of the company's websites and core web application
- Implemented the 'Shape Up' product development framework to streamline technical execution
- Applied insights from previous startup experience to optimize the development lifecycle and avoid common pitfalls
- Co-founded the company and led development of stabil.ai, an innovative AI-powered mobile app for personalized powerlifting training
- Engineered intelligent algorithms to personalize training plans using individual data (MRV/MEV) and dynamic real-time feedback
- Oversaw the full product lifecycle, from UX/UI concept and design to full-stack implementation, adhering to lean startup principles
- Led a small HR team responsible for the administration of over 600 soldiers
- Streamlined administrative processes and document workflows to improve efficiency in a high-stakes naval command environment
Skills
- Programming Languages
- Python, JavaScript, SQL
- ML Libraries
- PyTorch, Hugging Face, Scikit-learn, Pandas, NumPy
- Data Visualization
- Matplotlib, Seaborn
- ML Engineering
- Data & Feature Engineering, Hyperparameter Tuning (Optuna, Weights & Biases Sweeps), Experiment Tracking (Weights & Biases)
- Web Development
- React, React Native, Node.js, Express, HTML/CSS, Streamlit
- Tools & Platforms
- Git, Docker, Google Cloud Platform, Vercel
- Methodologies
- Scrum, Shape Up
Honors and Awards
- Achieved top-tier placement for a second consecutive year in the national competition organized by the Danish Society for Artificial Intelligence
- Achieved top-tier placement in a prestigious national competition for students and professionals, organized by the Danish Society for Artificial Intelligence
- Awarded 1st place out of numerous teams in an intensive 48-hour competition focused on sustainability
- An Event with cases from leading danish companies such as Danfoss or Linak
- Developed the winning solution for a real-world business case presented by the company WE-USE
- Formally commended for setting a benchmark in dedication, officially designating him as an exemplar of professionalism for all enlisted personnel
- Recognized for demonstrating an exceptionally optimistic and creative work ethic, proactively seeking out new responsibilities beyond his core duties, and thereby making a direct contribution to the unit's operational readiness and mission success
- Awarded a significant and rarely-issued financial bonus for sustained, far above-average performance that consistently exceeded all expectations
- Singled out for demonstrating an impeccable work morale and a profound sense of duty, trusted to independently manage complex tasks to the highest quality standards. His high social competence was noted as a key factor in improving the workload of superiors and fostering a positive and effective operational climate
Certifications
Professional certification in machine learning fundamentals and applications
Certification in C# programming fundamentals
Mobile development with React Native
Professional certification in UX design principles and methodologies
Comprehensive full-stack development certification
Professional certification in real estate loan brokerage
Get in Touch
I'm always open to talk about research ideas, potential collaborations, or entrepreneurial projects. If something on this page caught your attention, just reach out.