Evolve genai_demo_event to working POC with real embeddings

Transform genai_demo_event.cpp from skeleton to working POC that: - Integrates with real llama-server on port 8013 for embeddings - Uses shared memory (passing pointers, not copying data) - Supports single or multiple documents per request - Properly transfers memory ownership between GenAI and client Architecture changes: - Document struct: passed by pointer from client to GenAI - RequestHeader: includes document_count and operation type - ResponseHeader: includes embedding_size and embedding_ptr - EmbeddingResult: allocated by GenAI, owned by client after response libcurl integration: - HTTP POST to llama-server embedding API - JSON parsing of embedding responses - Error handling for network/API failures Key features: - Clients wait for response before sending next request - (ensures document pointers remain valid) - GenAI workers handle multiple concurrent requests - Embedding dimension: 1023 floats (from llama-server) - Processing time: 30-250ms (real API latency) Results: - 5 clients completed 9 embedding requests - All embeddings successfully retrieved - Zero-copy data transfer via shared memory pointers - Early termination when all work completed Future-ready: - Operation enum (OP_EMBEDDING, OP_COMPLETION, OP_RAG) - Extensible for other GenAI operations - Document count supports batch processing
4 months ago · 2c0f3a2e64
parent 012142eeed
commit 2c0f3a2e64
3 changed files with 476 additions and 318 deletions
--- a/genai_prototype/Makefile
+++ b/genai_prototype/Makefile
@ -3,7 +3,9 @@

 CXX = g++
 CXXFLAGS = -std=c++17 -Wall -Wextra -O2 -g
-LDFLAGS = -lpthread
+LDFLAGS = -lpthread -lcurl
+CURL_CFLAGS = $(shell curl-config --cflags)
+CURL_LDFLAGS = $(shell curl-config --libs)

 # Target executables
 TARGET_THREAD = genai_demo
@ -29,7 +31,7 @@ genai_demo: genai_demo.o

 genai_demo_event: genai_demo_event.o
 	@echo "Linking genai_demo_event..."
-	$(CXX) genai_demo_event.o $(LDFLAGS) -o genai_demo_event
+	$(CXX) genai_demo_event.o $(CURL_LDFLAGS) $(LDFLAGS) -o genai_demo_event
 	@echo "Build complete: genai_demo_event"

 # Compile source files
@ -39,7 +41,7 @@ genai_demo.o: genai_demo.cpp

 genai_demo_event.o: genai_demo_event.cpp
 	@echo "Compiling $<..."
-	$(CXX) $(CXXFLAGS) -c $< -o $@
+	$(CXX) $(CXXFLAGS) $(CURL_CFLAGS) -c $< -o $@

 # Run the demos
 run: $(TARGET_THREAD)
--- a/genai_prototype/genai_demo_event
+++ b/genai_prototype/genai_demo_event
--- a/genai_prototype/genai_demo_event.cpp
+++ b/genai_prototype/genai_demo_event.cpp