Exetools  

Go Back   Exetools > General > General Discussion

Notices

Reply
 
Thread Tools Display Modes
  #1  
Old 01-29-2025, 08:22
chants chants is offline
VIP
 
Join Date: Jul 2016
Posts: 819
Rept. Given: 46
Rept. Rcvd 50 Times in 31 Posts
Thanks Given: 730
Thanks Rcvd at 1,136 Times in 527 Posts
chants Reputation: 51
Running DeepSeek R1 locally

DeepSeek has its flagship V3 model equivalent to GPT4 and it's reasoning model R1 freely accessible:
Quote:
https://www.deepseek.com/
AI training for 5.6 million USD exceeding the quality of 100 mil to 1 bil USD. Inference of high quality is within reach of your own local environment where your data stays private. I've found their models better at reasoning than OpenAIs significantly. It's quite exciting and I'm surprised noone has brought the topic up yet, given the large amount of use cases for reverse engineering and the very low cost.

If you have a GPU recommend OLlama which works on Windows, Linux and Mac (can also rin Facebook/Meta's Llama models):
Quote:
https://ollama.com/
. Can choose from the models listed here:
Quote:
https://ollama.com/library/deepseek-r1
. 8b is pretty lightweight but if you have a recent Nvidia GPU with a lot of RAM why not go for 32b.

For a frontend chat interface, I recommend Chatbot AI:
Quote:
https://chatboxai.app
Even better the V3 and R1 model are open source and you can do your own model finishing if you have the resources.

R1:
Quote:
https://github.com/deepseek-ai/DeepSeek-R1
V3:
Quote:
https://github.com/deepseek-ai/DeepSeek-V3
Reply With Quote
The Following 4 Users Say Thank You to chants For This Useful Post:
blue_devil (01-29-2025), DARKER (01-29-2025), Doit (01-30-2025), wx69wx2023 (01-31-2025)
  #2  
Old 01-29-2025, 15:23
blue_devil's Avatar
blue_devil blue_devil is offline
Family
 
Join Date: Dec 2011
Location: Observable Universe
Posts: 437
Rept. Given: 93
Rept. Rcvd 60 Times in 33 Posts
Thanks Given: 474
Thanks Rcvd at 703 Times in 229 Posts
blue_devil Reputation: 60
Are there any reverse engineering (especially for decompilation), specific models?
Reply With Quote
  #3  
Old 01-29-2025, 16:09
deepzero's Avatar
deepzero deepzero is offline
VIP
 
Join Date: Mar 2010
Location: Germany
Posts: 307
Rept. Given: 114
Rept. Rcvd 64 Times in 42 Posts
Thanks Given: 186
Thanks Rcvd at 221 Times in 94 Posts
deepzero Reputation: 64
This makes DIGITS even more interesting to me, considering buying one. (But it will only be out in may). https://www.nvidia.com/en-eu/project-digits/

What I dont get though is how 1PFLOG in FP4 is a selling point when the regular GTX5090 has like 600 TFLOP in FP32? Am I missing something or is the 128GB integrated RAM the selling factor?
Reply With Quote
  #4  
Old 01-29-2025, 19:18
sendersu sendersu is offline
VIP
 
Join Date: Oct 2010
Posts: 1,293
Rept. Given: 335
Rept. Rcvd 236 Times in 126 Posts
Thanks Given: 325
Thanks Rcvd at 631 Times in 349 Posts
sendersu Reputation: 200-299 sendersu Reputation: 200-299 sendersu Reputation: 200-299
There is even more powerfull ML model then DeepSeek - Alibaba’s Qwen2.5-Max
Reply With Quote
  #5  
Old 01-29-2025, 20:10
DARKER DARKER is offline
VIP
 
Join Date: Jul 2004
Location: Somewhere Over the Rainbow
Posts: 507
Rept. Given: 15
Rept. Rcvd 121 Times in 53 Posts
Thanks Given: 18
Thanks Rcvd at 941 Times in 235 Posts
DARKER Reputation: 100-199 DARKER Reputation: 100-199
DeepSeek censorship:
Code:
https://www.theguardian.com/technology/2025/jan/28/we-tried-out-deepseek-it-works-well-until-we-asked-it-about-tiananmen-square-and-taiwan
Reply With Quote
The Following 2 Users Say Thank You to DARKER For This Useful Post:
blue_devil (01-30-2025), Gyrus (01-30-2025)
  #6  
Old 01-29-2025, 22:33
chants chants is offline
VIP
 
Join Date: Jul 2016
Posts: 819
Rept. Given: 46
Rept. Rcvd 50 Times in 31 Posts
Thanks Given: 730
Thanks Rcvd at 1,136 Times in 527 Posts
chants Reputation: 51
It would be nice to train an RE model. The good news now is that training is being shown to be feasible possibly on an academic grant level budget. Someone should train a proper open source RE model at some point.

1PFLOP FP4 is a marketing gimmick maybe, that amount of RAM is a big plus tho. The new DeepSeek models use FP8 and have shown it's reliable for training, a good breakthrough. Sounds good enough to run good size models at moderate load.

Alibaba sounds interesting haven't heard much about it.

By the way DeepSeek censorship from demos I saw is on the website but at least running R1 locally, it seems to not be censoring those things much or at all.
Reply With Quote
  #7  
Old 02-03-2025, 12:43
chants chants is offline
VIP
 
Join Date: Jul 2016
Posts: 819
Rept. Given: 46
Rept. Rcvd 50 Times in 31 Posts
Thanks Given: 730
Thanks Rcvd at 1,136 Times in 527 Posts
chants Reputation: 51
Censorship update: it appears if you download models and run locally there is no censorship.

Censoring is definitely done on their public website if you run your queries on their hardware. Here is how it works:
when you send your query it goes into the real model avatars thinking or generating a response.

At the same time, it is sent to a classifier model that is far cheaper, faster and specific. This classifier is trained with a prompt similar to "is the following '<prompt>' related to the following list of sensitive topics". If it returns yes, the main query is immediately aborted and a message displayed. Sometimes you can see it start to think and cut off a few sentences into it's deepthink. Other time it cuts off so fast that it appears nothing yet was emitted.

No problem, run it locally, you won't deal with the censorship classifier. It would be interesting though to have a list of the sensitive topics but that is probably kept secretly and securely.

Update: According to this article I am mistaken and the censorship concerns mentioned are legit:
Quote:
https://techcrunch.com/2025/02/03/no-deepseek-isnt-uncensored-if-you-run-it-locally/

Last edited by chants; 02-04-2025 at 07:04.
Reply With Quote
  #8  
Old 02-03-2025, 15:25
blue_devil's Avatar
blue_devil blue_devil is offline
Family
 
Join Date: Dec 2011
Location: Observable Universe
Posts: 437
Rept. Given: 93
Rept. Rcvd 60 Times in 33 Posts
Thanks Given: 474
Thanks Rcvd at 703 Times in 229 Posts
blue_devil Reputation: 60
I only can prompt twice, then deepseek says,
Quote:
The server is busy. Please try again later.
Reply With Quote
  #9  
Old 02-20-2025, 12:55
Fyyre's Avatar
Fyyre Fyyre is offline
Fyyre
 
Join Date: Dec 2009
Location: 0°N 0°E / 0°N 0°E / 0; 0
Posts: 279
Rept. Given: 90
Rept. Rcvd 87 Times in 40 Posts
Thanks Given: 176
Thanks Rcvd at 350 Times in 120 Posts
Fyyre Reputation: 87
I asked it about Falon Gong last night (locally) - and spent about 15 minutes debating it.

But let's be real.. ? Does anyone except it to behave otherwise--if so, you're kidding yourself and need to stop.

Locally its quite a useful tool... especially if you are playing with some of the more 'schizo' rethinking builds available via huggingface.

Regards,

Fyyre
__________________
Fyyre burnt out. I am the ashes.

--

https://github.com/Fyyre
Reply With Quote
  #10  
Old 02-21-2025, 17:52
0xGhostwire 0xGhostwire is offline
Guest
 
Join Date: Oct 2024
Posts: 3
Rept. Given: 0
Rept. Rcvd 0 Times in 0 Posts
Thanks Given: 6
Thanks Rcvd at 4 Times in 3 Posts
0xGhostwire Reputation: 0
Quote:
Originally Posted by blue_devil View Post
I only can prompt twice, then deepseek says,
I know its a late reply but the issue is still present. The reason is the recent spike in popularity through tons of youtube videos, tik toks etc etc. DeepSeek is under heavy pressure and currently not able to handle the user load. Their API was down for roughly 10 days at the beginning of february aswell.

Besides the normal user load, they currently have to manage, they are currently still under heavy DDOS attacks. Because of that its currently not even possible to top up API Credits

Hopefully they will resolve this soon.

In the meantime if you have an X Account Grok 3 seems to be the new kid on the block and atleast yesterday I was ablo to use it without any subscriptions.

Dont get used to it tho they hiked up the prices (they doubled them) and probably will get their asses kicked for it since nobody I know will pay 40$ a month for access to a Chat LLM if you can have slightly worse for 1/10 of the price.

Best regards
Reply With Quote
The Following User Says Thank You to 0xGhostwire For This Useful Post:
sendersu (02-22-2025)
  #11  
Old 02-22-2025, 12:37
Mendax47's Avatar
Mendax47 Mendax47 is offline
Family
 
Join Date: Jun 2016
Location: Earth..
Posts: 234
Rept. Given: 64
Rept. Rcvd 9 Times in 8 Posts
Thanks Given: 756
Thanks Rcvd at 272 Times in 105 Posts
Mendax47 Reputation: 9
https://github.com/albertan017/LLM4Decompile
Quote:
Originally Posted by blue_devil View Post
Are there any reverse engineering (especially for decompilation), specific models?
Reply With Quote
The Following 4 Users Say Thank You to Mendax47 For This Useful Post:
blue_devil (02-22-2025), chants (02-23-2025), mmx (02-28-2025), uranus64 (02-22-2025)
  #12  
Old 06-11-2025, 14:56
eychei eychei is offline
Friend
 
Join Date: Mar 2018
Posts: 58
Rept. Given: 0
Rept. Rcvd 0 Times in 0 Posts
Thanks Given: 34
Thanks Rcvd at 10 Times in 10 Posts
eychei Reputation: 0
Hi Guys,

there is a nice publication about this here: https://arxiv.org/pdf/2505.19915

Does anyone here know more about this topic and used such agents?


Best regards
Reply With Quote
  #13  
Old 06-11-2025, 16:47
tom324 tom324 is offline
Friend
 
Join Date: Jan 2002
Posts: 233
Rept. Given: 5
Rept. Rcvd 7 Times in 6 Posts
Thanks Given: 24
Thanks Rcvd at 28 Times in 17 Posts
tom324 Reputation: 7
Is there any tutorial on how to fine-tune Deepseek Coder V2?

I am facing dependency hell when trying to setup an environment that can be used for additional training on private C source code.

ollama is good for running it on Windows/WSL, but not for additional training.

Regards,
Tom
Reply With Quote
The Following User Says Thank You to tom324 For This Useful Post:
niculaita (06-12-2025)
  #14  
Old 06-12-2025, 05:50
Shub-Nigurrath's Avatar
Shub-Nigurrath Shub-Nigurrath is offline
VIP
 
Join Date: Mar 2004
Location: Obscure Kadath
Posts: 955
Rept. Given: 67
Rept. Rcvd 420 Times in 95 Posts
Thanks Given: 77
Thanks Rcvd at 371 Times in 114 Posts
Shub-Nigurrath Reputation: 400-499 Shub-Nigurrath Reputation: 400-499 Shub-Nigurrath Reputation: 400-499 Shub-Nigurrath Reputation: 400-499 Shub-Nigurrath Reputation: 400-499
This one is new too

https://huggingface.co/fdtn-ai/Foundation-Sec-8B
__________________
Ŝħůb-Ňìĝùŕřaŧħ ₪)
There are only 10 types of people in the world: Those who understand binary, and those who don't
http://www.accessroot.com
Reply With Quote
The Following User Says Thank You to Shub-Nigurrath For This Useful Post:
Fyyre (06-14-2025)
  #15  
Old 06-12-2025, 06:47
Shub-Nigurrath's Avatar
Shub-Nigurrath Shub-Nigurrath is offline
VIP
 
Join Date: Mar 2004
Location: Obscure Kadath
Posts: 955
Rept. Given: 67
Rept. Rcvd 420 Times in 95 Posts
Thanks Given: 77
Thanks Rcvd at 371 Times in 114 Posts
Shub-Nigurrath Reputation: 400-499 Shub-Nigurrath Reputation: 400-499 Shub-Nigurrath Reputation: 400-499 Shub-Nigurrath Reputation: 400-499 Shub-Nigurrath Reputation: 400-499
There are discussions on the Hugging Face page (e.g., fdtn-ai/Foundation-Sec-8B/discussions/10) about running this model with Ollama. Users have attempted it, and there's an Ollama entry for an "abliterated" (uncensored) version huihui_ai/foundation-sec-abliterated:8b
__________________
Ŝħůb-Ňìĝùŕřaŧħ ₪)
There are only 10 types of people in the world: Those who understand binary, and those who don't
http://www.accessroot.com
Reply With Quote
Reply

Tags
deepseek, ollama

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Writing to a running (in-use) executable file omidgl General Discussion 20 11-17-2005 00:54
Running program from memory Spiyre General Discussion 6 09-18-2004 09:34
How can I detect whether a 'Virtual machine' is currently running? me0007 General Discussion 5 06-16-2004 17:44
Need to find a pattern in a running file merlin General Discussion 14 07-20-2002 06:59


All times are GMT +8. The time now is 02:13.


Always Your Best Friend: Aaron, JMI, ahmadmansoor, ZeNiX, chessgod101
( 1998 - 2025 )