Fine Tuning DeepSeek and Llama Large Language Models with LoRA

dc.contributor.author Uluirmak, Bugra Alperen
dc.contributor.author Kurban, Rifat
dc.date.accessioned 2025-10-20T16:27:57Z
dc.date.available 2025-10-20T16:27:57Z
dc.date.issued 2025
dc.description.abstract In this paper, Low-Rank Adaptation (LoRA) finetuning of two different large language models (DeepSeek R1 Distill 8B and Llama3.1 8B) was performed using the Turkish dataset. Training was performed on Google Colab using A100 40 GB GPU, while the testing phase was carried out on Runpod using L4 24 GB GPU. The 64.6 thousand row dataset was transformed into question-answer pairs from the fields of agriculture, education, law and sustainability. In the testing phase, 40 test questions were asked for each model via Ollama web UI and the results were supported with graphs and detailed tables. It was observed that the performance of the existing language models improved with the fine-tuning method. en_US
dc.identifier.doi 10.1109/SIU66497.2025.11112387
dc.identifier.isbn 9798331566562
dc.identifier.isbn 9798331566555
dc.identifier.issn 2165-0608
dc.identifier.scopus 2-s2.0-105015366215
dc.identifier.uri https://doi.org/10.1109/SIU66497.2025.11112387
dc.language.iso tr en_US
dc.publisher IEEE en_US
dc.relation.ispartof 33rd Conference on Signal Processing and Communications Applications-SIU-Annual -- Jun 25-28, 2025 -- Istanbul, Turkiye en_US
dc.relation.ispartofseries Signal Processing and Communications Applications Conference
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Large Language Models en_US
dc.subject Fine-Tuning en_US
dc.subject LoRA en_US
dc.subject Turkish LLM Dataset en_US
dc.title Fine Tuning DeepSeek and Llama Large Language Models with LoRA en_US
dc.title.alternative DeepSeek ve Llama Büyük Dil Modellerinin LoRa ile İnce Ayarı
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.wosid Kurban, Rifat/B-1175-2012
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access metadata only access
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.department Abdullah Gül Üniversitesi en_US
gdc.description.departmenttemp [Uluirmak, Bugra Alperen; Kurban, Rifat] Abdullah Gul Univ, Dept Comp Engn, Kayseri, Turkiye en_US
gdc.description.endpage 4
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.startpage 1
gdc.description.woscitationindex Conference Proceedings Citation Index - Science
gdc.description.wosquality N/A
gdc.identifier.openalex W4413467646
gdc.identifier.wos WOS:001575462500338
gdc.index.type WoS
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.impulse 0.0
gdc.oaire.influence 2.5349236E-9
gdc.oaire.isgreen false
gdc.oaire.popularity 2.8669784E-9
gdc.oaire.publicfunded false
gdc.openalex.collaboration National
gdc.openalex.fwci 4.81974515
gdc.openalex.normalizedpercentile 0.94
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 0
gdc.plumx.mendeley 2
gdc.plumx.scopuscites 1
gdc.scopus.citedcount 1
gdc.virtual.author Kurban, Rifat
gdc.wos.citedcount 1
relation.isAuthorOfPublication f55f9796-680f-4dd5-9c98-e43d0ffee812
relation.isAuthorOfPublication.latestForDiscovery f55f9796-680f-4dd5-9c98-e43d0ffee812
relation.isOrgUnitOfPublication 52f507ab-f278-4a1f-824c-44da2a86bd51
relation.isOrgUnitOfPublication 665d3039-05f8-4a25-9a3c-b9550bffecef
relation.isOrgUnitOfPublication ef13a800-4c99-4124-81e0-3e25b33c0c2b
relation.isOrgUnitOfPublication.latestForDiscovery 52f507ab-f278-4a1f-824c-44da2a86bd51

Files