32,768 is the number of raw individual instructions, not the compiled size.MaxSt wrote:hq4x32.obj = 300Kb.Clements wrote:Pixel Shader 3.0 allows for 32,768 maximum instructions, while GeForce FX handle up to 512.
MaxSt.
Opengl for zsnes under win.
Moderator: ZSNES Mods
-
- ZSNES Developer
- Posts: 3904
- Joined: Tue Jul 27, 2004 10:54 pm
- Location: Solar powered park bench
- Contact:
Say in that object file each unstruction is 4 bytes which is logical since it's a 32bit object file, say also the object file is wasting 100KB for misc overhead.
(300KB-100KB)/4 is still > 32KB.
(300KB-100KB)/4 is still > 32KB.
May 9 2007 - NSRT 3.4, now with lots of hashing and even more accurate information! Go download it.
_____________
Insane Coding
_____________
Insane Coding
-
- Romhacking God
- Posts: 922
- Joined: Wed Jul 28, 2004 11:27 pm
- Contact:
Ok you pessimistic nugget. We know you hate all new 3D API's and would love to use several year old DirectDraw and blitting forever, but your constant barrage of comments like this is starting to get on my nerves.MaxSt wrote:hq4x32.obj = 300Kb.Clements wrote:Pixel Shader 3.0 allows for 32,768 maximum instructions, while GeForce FX handle up to 512.
MaxSt.
No one was talking about HQ4x. That's not gonna fit in any shader. We know that. What's your point?
[url=http://transcorp.romhacking.net]TransCorp[/url] - Home of the Dual Orb 2, Cho Mahou Tairyku Wozz, and Emerald Dragon SFC/SNES translations.
[url=http://www.romhacking.net]ROMhacking.net[/url] - The central hub of the ROM hacking community.
[url=http://www.romhacking.net]ROMhacking.net[/url] - The central hub of the ROM hacking community.
-
- Romhacking God
- Posts: 922
- Joined: Wed Jul 28, 2004 11:27 pm
- Contact:
The newest ATI cards STILL do not support PS3.0 and nvidia does. It's not just an GeforceFX sucks deal.Noxious Ninja wrote:The GeForce FX line was pretty bad. nVidia is lucky there weren't many heavy DX9 games yet. Fortunately, the GeFirce 6 series is much better.
[url=http://transcorp.romhacking.net]TransCorp[/url] - Home of the Dual Orb 2, Cho Mahou Tairyku Wozz, and Emerald Dragon SFC/SNES translations.
[url=http://www.romhacking.net]ROMhacking.net[/url] - The central hub of the ROM hacking community.
[url=http://www.romhacking.net]ROMhacking.net[/url] - The central hub of the ROM hacking community.
-
- Dark Wind
- Posts: 1271
- Joined: Thu Jul 29, 2004 8:58 pm
- Location: Texas
- Contact:
I know. Reading the freedesktop mailing list and some others makes me realize the whole graphics industry sucks. Yes, 1 GB of video RAM really is useful. We should've had PCI-Express years ago. Video drivers should be open-source, or at least be Mesa backends instead of complete OpenGL implementations.Nightcrawler wrote:The newest ATI cards STILL do not support PS3.0 and nvidia does. It's not just an GeforceFX sucks deal.Noxious Ninja wrote:The GeForce FX line was pretty bad. nVidia is lucky there weren't many heavy DX9 games yet. Fortunately, the GeFirce 6 series is much better.
It looks like the Freedesktop guys have got buy-in from the major GPU vendors, so this will hopefully change.
[u][url=http://bash.org/?577451]#577451[/url][/u]
32KB is not a size limit for shaders. 32K is the instruction count, not compiled size. And that's still a compiled binary. With shader programs they are compiled by the video driver, so you can't really tell how large they are compiled.Nach wrote:Say in that object file each unstruction is 4 bytes which is logical since it's a 32bit object file, say also the object file is wasting 100KB for misc overhead.
(300KB-100KB)/4 is still > 32KB.
Of course, if you can free up main CPU time while still having extra filters, isn't that a good thing overall?MaxSt wrote:My point is that hq4x is the only one filter that really clogs the CPU.Nightcrawler wrote:No one was talking about HQ4x. That's not gonna fit in any shader. We know that. What's your point?
Therefore, it's the only filter that make sense to move from CPU to GPU.
MaxSt.
The GeforceFX sucks because it was a poor design, not because it lacked shader 3.0. GeforceFX had crappy register usage, and horrible floating point math capabilities. The latest Radeon chips have great FP math, they just lack shader 3, which isn't a big deal just yet. The fact that nvidia had to create a totally new design for the 6800 while all ATI did for X800 was put 2 Radeon 9800 cores into 1 shows just how bad GeforceFX was, and how good R300 was.Nightcrawler wrote:The newest ATI cards STILL do not support PS3.0 and nvidia does. It's not just an GeforceFX sucks deal.Noxious Ninja wrote:The GeForce FX line was pretty bad. nVidia is lucky there weren't many heavy DX9 games yet. Fortunately, the GeFirce 6 series is much better.
-
- Romhacking God
- Posts: 922
- Joined: Wed Jul 28, 2004 11:27 pm
- Contact:
Ok.. I apologize for being harsh in my last post, but without explanation it sounded pointless.MaxSt wrote:My point is that hq4x is the only one filter that really clogs the CPU.Nightcrawler wrote:No one was talking about HQ4x. That's not gonna fit in any shader. We know that. What's your point?
Therefore, it's the only filter that make sense to move from CPU to GPU.
MaxSt.
I also find HQ2x to cause some noticable performance loss on my computer(Athlon Thunderbird 1.4Ghz/Geforce 2).
I can also notice a small difference with the likes of Super 2XSAI or Super Eagle. However.. people with older computers such as myself are probaly likely NOT to have a video card capable of shaders anyway. If you have a new enough system to have a video card with those capablities, I guess HQ4x would probably be the only filter that would bog you down.
How large is HQ2x anyway?
Despite noticable CPU drains, offloading any filter will speed things up to some degree.
[url=http://transcorp.romhacking.net]TransCorp[/url] - Home of the Dual Orb 2, Cho Mahou Tairyku Wozz, and Emerald Dragon SFC/SNES translations.
[url=http://www.romhacking.net]ROMhacking.net[/url] - The central hub of the ROM hacking community.
[url=http://www.romhacking.net]ROMhacking.net[/url] - The central hub of the ROM hacking community.
-
- Dark Wind
- Posts: 1271
- Joined: Thu Jul 29, 2004 8:58 pm
- Location: Texas
- Contact:
-
- Romhacking God
- Posts: 922
- Joined: Wed Jul 28, 2004 11:27 pm
- Contact:
Noxious Ninja wrote:Yeah, but your GeForce 2 isn't going to be capable of much offloading. Does it even support shaders at all?
I just said that. My Geforce 2 wouldn't help in this case.Nightcrawler wrote:However.. people with older computers such as myself are probaly likely NOT to have a video card capable of shaders anyway.
[url=http://transcorp.romhacking.net]TransCorp[/url] - Home of the Dual Orb 2, Cho Mahou Tairyku Wozz, and Emerald Dragon SFC/SNES translations.
[url=http://www.romhacking.net]ROMhacking.net[/url] - The central hub of the ROM hacking community.
[url=http://www.romhacking.net]ROMhacking.net[/url] - The central hub of the ROM hacking community.
-
- ZSNES Shake Shake Prinny
- Posts: 5632
- Joined: Wed Jul 28, 2004 4:15 pm
- Location: PAL50, dood !
Linux debug object files:
-rw------- 1 grin users 182864 Mar 1 20:31 video/hq2x16.o
-rw------- 1 grin users 114800 Mar 1 20:31 video/hq2x32.o
-rw------- 1 grin users 235120 Mar 1 20:31 video/hq3x16.o
-rw------- 1 grin users 144496 Mar 1 20:31 video/hq3x32.o
-rw------- 1 grin users 504992 Mar 1 20:31 video/hq4x16.o
-rw------- 1 grin users 328272 Mar 1 20:31 video/hq4x32.o
Reply to post under this one:
Yeah, I forgot. My obj files are always debug. The release-grade ones are erased upon upx completion.
-rw------- 1 grin users 182864 Mar 1 20:31 video/hq2x16.o
-rw------- 1 grin users 114800 Mar 1 20:31 video/hq2x32.o
-rw------- 1 grin users 235120 Mar 1 20:31 video/hq3x16.o
-rw------- 1 grin users 144496 Mar 1 20:31 video/hq3x32.o
-rw------- 1 grin users 504992 Mar 1 20:31 video/hq4x16.o
-rw------- 1 grin users 328272 Mar 1 20:31 video/hq4x32.o
Reply to post under this one:
Yeah, I forgot. My obj files are always debug. The release-grade ones are erased upon upx completion.
Last edited by grinvader on Wed Mar 02, 2005 11:51 pm, edited 1 time in total.
皆黙って俺について来い!!
Pantheon: Gideon Zhi | CaitSith2 | Nach | kode54
Code: Select all
<jmr> bsnes has the most accurate wiki page but it takes forever to load (or something)
-
- ZSNES Developer
- Posts: 3904
- Joined: Tue Jul 27, 2004 10:54 pm
- Location: Solar powered park bench
- Contact:
You mean Linux object files with debugging enabled which is larger.
Windows and Linux object files:
-rw-r--r-- 1 nach nach 161088 2005-03-03 00:49 video/hq2x16.o
-rw-r--r-- 1 nach nach 175232 2005-02-08 04:35 video/hq2x16.obj
-rw-r--r-- 1 nach nach 88640 2005-03-03 00:49 video/hq2x32.o
-rw-r--r-- 1 nach nach 96220 2005-02-08 04:35 video/hq2x32.obj
-rw-r--r-- 1 nach nach 200704 2005-03-03 00:49 video/hq3x16.o
-rw-r--r-- 1 nach nach 217786 2005-02-08 04:35 video/hq3x16.obj
-rw-r--r-- 1 nach nach 103712 2005-03-03 00:49 video/hq3x32.o
-rw-r--r-- 1 nach nach 112102 2005-02-08 04:35 video/hq3x32.obj
-rw-r--r-- 1 nach nach 448640 2005-03-03 00:49 video/hq4x16.o
-rw-r--r-- 1 nach nach 493163 2005-02-08 04:35 video/hq4x16.obj
-rw-r--r-- 1 nach nach 267488 2005-03-03 00:49 video/hq4x32.o
-rw-r--r-- 1 nach nach 294695 2005-02-08 04:35 video/hq4x32.obj
Windows and Linux object files:
-rw-r--r-- 1 nach nach 161088 2005-03-03 00:49 video/hq2x16.o
-rw-r--r-- 1 nach nach 175232 2005-02-08 04:35 video/hq2x16.obj
-rw-r--r-- 1 nach nach 88640 2005-03-03 00:49 video/hq2x32.o
-rw-r--r-- 1 nach nach 96220 2005-02-08 04:35 video/hq2x32.obj
-rw-r--r-- 1 nach nach 200704 2005-03-03 00:49 video/hq3x16.o
-rw-r--r-- 1 nach nach 217786 2005-02-08 04:35 video/hq3x16.obj
-rw-r--r-- 1 nach nach 103712 2005-03-03 00:49 video/hq3x32.o
-rw-r--r-- 1 nach nach 112102 2005-02-08 04:35 video/hq3x32.obj
-rw-r--r-- 1 nach nach 448640 2005-03-03 00:49 video/hq4x16.o
-rw-r--r-- 1 nach nach 493163 2005-02-08 04:35 video/hq4x16.obj
-rw-r--r-- 1 nach nach 267488 2005-03-03 00:49 video/hq4x32.o
-rw-r--r-- 1 nach nach 294695 2005-02-08 04:35 video/hq4x32.obj
May 9 2007 - NSRT 3.4, now with lots of hashing and even more accurate information! Go download it.
_____________
Insane Coding
_____________
Insane Coding
It won't be sitting idle though, it will be busy with emulation. Should we change all DirectDraw acceleration to software so that the CPU won't be idling also?MaxSt wrote:I don't see the point. It's CPU's job - to crunch numbers. Why it should sit idle instead?Reznor007 wrote:Of course, if you can free up main CPU time while still having extra filters, isn't that a good thing overall?
hq2x32.obj = 90Kb.Nightcrawler wrote:How large is HQ2x anyway?
MaxSt.
What if you are encoding MP3s or doing some other intensive task at the same time? Just because most CPUs can handle the basic emulation doesn't mean that offloading some of the task to something else has no merit at all. Not that I really know anything about how all this works, but if it is possible to utilise a system more efficiently I don't see why it should be ruled out.
Peace Forever!
-
- Romhacking God
- Posts: 922
- Joined: Wed Jul 28, 2004 11:27 pm
- Contact:
I agree. Just because we have a powerful machines, does that make it so we should all code inefficiently? I don't think so. If an obvious opportunity arises to free some CPU time.. why not take it?
You're mentality is basically saying who cares about code optimization if the CPU can already runs it full speed. I'd say most programmers would not share your views there.
You're mentality is basically saying who cares about code optimization if the CPU can already runs it full speed. I'd say most programmers would not share your views there.
[url=http://transcorp.romhacking.net]TransCorp[/url] - Home of the Dual Orb 2, Cho Mahou Tairyku Wozz, and Emerald Dragon SFC/SNES translations.
[url=http://www.romhacking.net]ROMhacking.net[/url] - The central hub of the ROM hacking community.
[url=http://www.romhacking.net]ROMhacking.net[/url] - The central hub of the ROM hacking community.
Re: Opengl for zsnes under win.
Thank you so much. Greatly looking forward to this in the future.pagefault wrote: Back on topic, thanks I will merge your changes into the latest CVS.
You didn't specify SNES emulation though, but that's beside the point. Should we all run software sound cards, video cards, etc simply because the CPU can do the work also? What if I feel like compiling a new MAME build while I play ZSNES?MaxSt wrote:Run ZSNES on P2-400 without gfx enhancements.Reznor007 wrote:Run Gauntlet Legends in MAME then come back and say emulation is not a big task.
Then say how P4-3GHz will be terribly busy with ZSNES emulation.
MaxSt.
The way I see it, if you can offload some work onto a separate device, and devote more CPU time to improving emulation then it's an easy answer.
I'd say it's your problem.Reznor007 wrote:What if I feel like compiling a new MAME build while I play ZSNES?
Emulation won't be improved automatically just because you'll throw more CPU cycles at it. After you achieved 60/60, what else CPU is suppose to do?Reznor007 wrote:The way I see it, if you can offload some work onto a separate device, and devote more CPU time to improving emulation then it's an easy answer.
MaxSt.